Assessing the Quality of Multilevel Graph Clustering

Abstract : Hierarchical clustering of graphs is a useful strategy to mine, explore and visualize graphs. Popular approaches define ad hoc procedures to decide how subgraphs are subdivided or nested. The popularity of graph hierarchies certainly relates to the relevance of multilevel models appearing in the natural and social sciences. For instance, current models in biology (genomics and/or proteomics) try to capture the multilevel nature of networks formed by various biological entities; cities and worldwide city systems in geography can also be described as multilevel networks. In our opinion, a theory supporting these multilevel clustering approaches is yet to be developed. Indeed, to the best of our knowledge there are no known optimization multilevel criteria guiding the construction of a hierarchy of clusters: the hierarchy basically is an artefact of an iterative procedure. The main results of this paper contribute to such a multilevel clustering theory, by designing and studying a multilevel modularity measure for hierarchically clustered graphs, explicitly taking the nesting structure of clusters into account. The multilevel modularity we propose generalizes a modularity measure introduced by Mancoridis et al. in the context of reverse software engineering. The measure we designed recursively traverses the hierarchy of clusters and computes a one-variable polynomial encoding the intra and inter-cluster densities appearing at all levels in a hierarchical clustering. The resulting polynomial reflects how the graph combines with the hierarchy of clusters and can be used to assess the quality of a hierarchical clustering. We discuss archetypal examples as proof-of-concept. We also look at how this multilevel modularity acts on a popular real world example.
Type de document :
Article dans une revue
Data Mining and Knowledge Discovery, Springer Verlag, 2014, 28 (4), pp.1107-1128. <10.1007/s10618-013-0335-9>
Liste complète des métadonnées


https://hal.archives-ouvertes.fr/hal-00579474
Contributeur : François Queyroi <>
Soumis le : vendredi 29 juillet 2011 - 09:40:01
Dernière modification le : mercredi 2 décembre 2015 - 01:08:29
Document(s) archivé(s) le : dimanche 4 décembre 2016 - 19:51:06

Fichier

Delest2011Mqq.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

Citation

François Queyroi, Maylis Delest, Jean-Marc Fédou, Guy Melançon. Assessing the Quality of Multilevel Graph Clustering. Data Mining and Knowledge Discovery, Springer Verlag, 2014, 28 (4), pp.1107-1128. <10.1007/s10618-013-0335-9>. <hal-00579474v2>

Partager

Métriques

Consultations de
la notice

517

Téléchargements du document

366