This paper initiates formal analysis of a simple, distributed algorithm for community detection on networks. We analyze an algorithm that we call Max-LPA, both in terms of its convergence time and in terms of the "quality" of the communities detected. Max-LPA is an instance of a class of community detection algorithms called label propagation algorithms. As far as we know, most analysis of label propagation algorithms thus far has been empirical in nature and in this paper we seek a theoretical understanding of label propagation algorithms. In our main result, we define a clustered version of Erdös-Rényi random graphs with clusters V1, V 2, . . . , Vk where the probability p′, of an edge connecting nodes within a cluster Vi is higher than p′, the probability of an edge connecting nodes in distinct clusters. We show that even with fairly general restrictions on p and p′ (p = Ω (1/n 1/4-Ω) for any ε > 0, p′ = O (p2), where n is the number of nodes), MAX-LPA detects the clusters V1, V2, . . . , Vn in just two rounds. Based on this and on empirical results, we conjecture that Max-LPA can correctly and quickly identify communities on clustered Erdös-Rényi graphs even when the clusters are much sparser, i.e., with p = c log n/n for some c > 1. © Springer-Verlag 2013.
CITATION STYLE
Kothapalli, K., Pemmaraju, S. V., & Sardeshmukh, V. (2013). On the analysis of a label propagation algorithm for community detection. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 7730 LNCS, pp. 255–269). https://doi.org/10.1007/978-3-642-35668-1_18
Mendeley helps you to discover research relevant for your work.