Clustering-based Graph Numbering using Execution Traces for Cache Misses Reduction in Graph Analysis Applications

Régis Audran Mogo Wafo; Thomas Messi Nguélé; Xaviera Youh Djam

Pré-Publication, Document De Travail Année : 2022

Clustering-based Graph Numbering using Execution Traces for Cache Misses Reduction in Graph Analysis Applications

, (1) ,

Régis Audran Mogo Wafo

Fonction : Auteur
PersonId : 1136072

Thomas Messi Nguélé

Fonction : Auteur
PersonId : 8299
IdHAL : messi

Laboratoire d'Informatique de Grenoble

Xaviera Youh Djam

Fonction : Auteur

Résumé

Social graph analysis is generally based on a local exploration of the underlying graph. That is, the analysis of a node of the graph is often done after having analyzed nodes located in its vicinity. However, over the time, networks are bound to grow with the addition of new members, which inevitably leads to the enlargement of the corresponding graphs. At this level we therefore have a problem because more the size of the graph increases, more the execution time of graph analysis applications too. This is due to the very large number of nodes that will need to be treated. Some recent work in-faces this problem by exploiting the properties of social networks such as the community structure to renumber the nodes of the graph in order to reduce cache misses. Reducing cache misses in an application allows to reduce the execution time of this application. In this paper, we argue that combining existing graph ordering with a new numbering that exploit execution traces analysis can allow to improve cache misses reduction and hence execution time reduction. The idea is to build graph numbering using execution traces of graph analysis applications and then combine it with an existing graph numbering (such as cn-order). To build this new ordering, we define a new distance and then used it to analyse execution traces with well known clustering algorithms K-means (for Kmeans-order) and hierarchical clustering (for cl-hier-order). Experiments on a user machine (dual-core) and four cores of Grid'5000 node (Neowise) show that this combination improves slightly existing graph ordering (cn-order, numbaco, rabbit and gorder) in almost all the cases (the two cores of dual-core, all the four cores of neowise), with PageRank graph application and astro-ph dataset. For example, on neowise with one thread and Astro-ph dataset, the best performance is given with the combination kmeans-order_cn-order which allows to reduce by 42.59% the cache misses (compared to the second numbaco with 40.79%) and therefore by 7.27 % the time of execution (compared to 6.89% for the second numbaco).

Mots clés

Graph analysis Execution trace Machine learning Cache misses reduction

Domaines

Informatique [cs]

Fichier principal

Clustering-based_Graph_Numbering.pdf (271.15 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Thomas Messi Nguélé : Connectez-vous pour contacter le contributeur

https://hal.science/hal-03687348

Soumis le : vendredi 3 juin 2022-11:43:40

Dernière modification le : jeudi 4 avril 2024-21:04:17

Archivage à long terme le : dimanche 4 septembre 2022-18:46:54

Dates et versions

hal-03687348 , version 1 (03-06-2022)

Identifiants

HAL Id : hal-03687348 , version 1

Citer

Régis Audran Mogo Wafo, Thomas Messi Nguélé, Xaviera Youh Djam. Clustering-based Graph Numbering using Execution Traces for Cache Misses Reduction in Graph Analysis Applications. 2022. ⟨hal-03687348⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UGA CNRS LIG GRID5000 SILECS LIG_SIDCH

42 Consultations

22 Téléchargements

Clustering-based Graph Numbering using Execution Traces for Cache Misses Reduction in Graph Analysis Applications

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager