Clustering-based Graph Numbering using Execution Traces for Cache Misses Reduction in Graph Analysis Applications - Archive ouverte HAL Accéder directement au contenu
Pré-Publication, Document De Travail Année : 2022

Clustering-based Graph Numbering using Execution Traces for Cache Misses Reduction in Graph Analysis Applications

Régis Audran Mogo Wafo
  • Fonction : Auteur
  • PersonId : 1136072
Thomas Messi Nguélé
  • Fonction : Auteur
  • PersonId : 8299
  • IdHAL : messi
Xaviera Youh Djam
  • Fonction : Auteur

Résumé

Social graph analysis is generally based on a local exploration of the underlying graph. That is, the analysis of a node of the graph is often done after having analyzed nodes located in its vicinity. However, over the time, networks are bound to grow with the addition of new members, which inevitably leads to the enlargement of the corresponding graphs. At this level we therefore have a problem because more the size of the graph increases, more the execution time of graph analysis applications too. This is due to the very large number of nodes that will need to be treated. Some recent work in-faces this problem by exploiting the properties of social networks such as the community structure to renumber the nodes of the graph in order to reduce cache misses. Reducing cache misses in an application allows to reduce the execution time of this application. In this paper, we argue that combining existing graph ordering with a new numbering that exploit execution traces analysis can allow to improve cache misses reduction and hence execution time reduction. The idea is to build graph numbering using execution traces of graph analysis applications and then combine it with an existing graph numbering (such as cn-order). To build this new ordering, we define a new distance and then used it to analyse execution traces with well known clustering algorithms K-means (for Kmeans-order) and hierarchical clustering (for cl-hier-order). Experiments on a user machine (dual-core) and four cores of Grid'5000 node (Neowise) show that this combination improves slightly existing graph ordering (cn-order, numbaco, rabbit and gorder) in almost all the cases (the two cores of dual-core, all the four cores of neowise), with PageRank graph application and astro-ph dataset. For example, on neowise with one thread and Astro-ph dataset, the best performance is given with the combination kmeans-order_cn-order which allows to reduce by 42.59% the cache misses (compared to the second numbaco with 40.79%) and therefore by 7.27 % the time of execution (compared to 6.89% for the second numbaco).
Fichier principal
Vignette du fichier
Clustering-based_Graph_Numbering.pdf (271.15 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-03687348 , version 1 (03-06-2022)

Identifiants

  • HAL Id : hal-03687348 , version 1

Citer

Régis Audran Mogo Wafo, Thomas Messi Nguélé, Xaviera Youh Djam. Clustering-based Graph Numbering using Execution Traces for Cache Misses Reduction in Graph Analysis Applications. 2022. ⟨hal-03687348⟩
42 Consultations
22 Téléchargements

Partager

Gmail Facebook X LinkedIn More