Assessing the Quality of Multilevel Graph Clustering - Archive ouverte HAL Accéder directement au contenu
Rapport Année : 2011

Assessing the Quality of Multilevel Graph Clustering

Résumé

Hierarchical clustering of graphs is a useful strategy to mine, explore and visualize graphs. Popular approaches define ad hoc procedures to decide how subgraphs are subdivided or nested. The popularity of graph hierarchies certainly relates to the relevance of multilevel models appearing in the natural and social sciences. For instance, current models in biology (genomics and/or proteomics) try to capture the multilevel nature of networks formed by various biological entities; cities and worldwide city systems in geography can also be described as multilevel networks. In our opinion, a theory supporting these multilevel clustering approaches is yet to be developed. Indeed, to the best of our knowledge there are no known optimization multilevel criteria guiding the construction of a hierarchy of clusters: the hierarchy basically is an artefact of an iterative procedure. The main results of this paper contribute to such a multilevel clustering theory, by designing and studying a multilevel modularity measure for hierarchically clustered graphs, explicitly taking the nesting structure of clusters into account. The multilevel modularity we propose generalizes a modularity measure introduced by Mancoridis et al. in the context of reverse software engineering. The measure we designed recursively traverses the hierarchy of clusters and computes a one-variable polynomial encoding the intra and inter-cluster densities appearing at all levels in a hierarchical clustering. The resulting polynomial reflects how the graph combines with the hierarchy of clusters and can be used to assess the quality of a hierarchical clustering. We discuss archetypal examples as proof-of-concept. We also look at how this multilevel modularity acts on a popular real world example.
Fichier principal
Vignette du fichier
Delest_sub_KDD2011.pdf (847.08 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-00579474 , version 1 (24-03-2011)
hal-00579474 , version 2 (29-07-2011)

Identifiants

  • HAL Id : hal-00579474 , version 1

Citer

Maylis Delest, Guy Melançon, François Queyroi, Jean-Marc Fédou. Assessing the Quality of Multilevel Graph Clustering. 2011. ⟨hal-00579474v1⟩
526 Consultations
480 Téléchargements

Partager

Gmail Facebook X LinkedIn More