Parallel community detection for massive graphs

E. Jason Riedy; Henning Meyerhenke; David Ediger; David A. Bader

Conference Proceedings

Parallel community detection for massive graphs

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2012) 7203 LNCS(PART 1) 286-296

DOI: 10.1007/978-3-642-31464-3_29

43Citations

66Readers

Get full text

Abstract

Tackling the current volume of graph-structured data requires parallel tools. We extend our work on analyzing such massive graph data with the first massively parallel algorithm for community detection that scales to current data sizes, scaling to graphs of over 122 million vertices and nearly 2 billion edges in under 7300 seconds on a massively multithreaded Cray XMT. Our algorithm achieves moderate parallel scalability without sacrificing sequential operational complexity. Community detection partitions a graph into subgraphs more densely connected within the subgraph than to the rest of the graph. We take an agglomerative approach similar to Clauset, Newman, and Moore's sequential algorithm, merging pairs of connected intermediate subgraphs to optimize different graph properties. Working in parallel opens new approaches to high performance. On smaller data sets, we find the output's modularity compares well with the standard sequential algorithms. © 2012 Springer-Verlag.

Author supplied keywords

Cite

CITATION STYLE

APA

Riedy, E. J., Meyerhenke, H., Ediger, D., & Bader, D. A. (2012). Parallel community detection for massive graphs. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 7203 LNCS, pp. 286–296). https://doi.org/10.1007/978-3-642-31464-3_29

Parallel community detection for massive graphs

Abstract

Author supplied keywords

Cite

Register to see more suggestions