Recovering asymmetric communities in the stochastic block model

Francesco Caltagirone 1 Marc Lelarge 2, 1 Léo Miolane 1
1 DYOGENE - Dynamics of Geometric Networks
DI-ENS - Département d'informatique de l'École normale supérieure, CNRS - Centre National de la Recherche Scientifique : UMR 8548, Inria de Paris
Abstract : — We consider the sparse stochastic block model in the case where the degrees are uninformative. The case where the two communities have approximately the same size has been extensively studied and we concentrate here on the community detection problem in the case of unbalanced communities. In this setting, spectral algorithms based on the non-backtracking matrix are known to solve the community detection problem (i.e. do strictly better than a random guess) when the signal is sufficiently large namely above the so-called Kesten Stigum threshold. In this regime and when the average degree tends to infinity, we show that if the community of a vanishing fraction of the vertices is revealed, then a local algorithm (belief propagation) is optimal down to Kesten Stigum threshold and we quantify explicitly its performance. Below the Kesten Stigum threshold, we show that, in the large degree limit, there is a second threshold called the spinodal curve below which, the community detection problem is not solvable. The spinodal curve is equal to the Kesten Stigum threshold when the fraction of vertices in the smallest community is above p * = 1 2 − 1 2 √ 3 , so that the Kesten Stigum threshold is the threshold for solvability of the community detection in this case. However when the smallest community is smaller than p * , the spinodal curve only provides a lower bound on the threshold for solvability. In the regime below the Kesten Stigum bound and above the spinodal curve, we also characterize the performance of best local algorithms as a function of the fraction of revealed vertices. Our proof relies on a careful analysis of the associated reconstruction problem on trees which might be of independent interest. In particular, we show that the spinodal curve corresponds to the reconstruction threshold on the tree.
Type de document :
Communication dans un congrès
allerton 2016 54th Annual Allerton Conference on Communication, Control, and Computing, Sep 2016, Monticello, United States
Liste complète des métadonnées

Littérature citée [16 références]  Voir  Masquer  Télécharger

https://hal.archives-ouvertes.fr/hal-01391609
Contributeur : Marc Lelarge <>
Soumis le : jeudi 3 novembre 2016 - 15:36:56
Dernière modification le : samedi 1 décembre 2018 - 01:24:26
Document(s) archivé(s) le : samedi 4 février 2017 - 13:52:22

Fichier

1610.03680v1.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : hal-01391609, version 1

Citation

Francesco Caltagirone, Marc Lelarge, Léo Miolane. Recovering asymmetric communities in the stochastic block model. allerton 2016 54th Annual Allerton Conference on Communication, Control, and Computing, Sep 2016, Monticello, United States. 〈hal-01391609〉

Partager

Métriques

Consultations de la notice

349

Téléchargements de fichiers

123