Abstract
Hypergraphs can capture higher-order relations between subsets of objects instead of only pairwise relations as in graphs. Hypergraph clustering is an important task in information retrieval and machine learning. We study the problem of distributed hypergraph clustering in the message passing communication model using small communication cost. We propose an algorithm framework for distributed hypergraph clustering based on spectral hypergraph sparsification. For an n-vertex hypergraph G with hyperedges of maximum size r distributed at s sites arbitrarily and a parameter g (0,1), our algorithm can produce a vertex set with conductance O(g1+gG), where G is the conductance of G, using communication cost O(nr2s/O(1)) (O hides a polylogarithmic factor). The theoretical results are complemented with extensive experiments to demonstrate the efficiency and effectiveness of the proposed algorithm under different real-world datasets. Our source code is publicly available at github.com/chunjiangzhu/dhgc.
Author supplied keywords
Cite
CITATION STYLE
Zhu, C. J., Liu, Q., & Bi, J. (2021). Communication Efficient Distributed Hypergraph Clustering. In SIGIR 2021 - Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval (pp. 2131–2135). Association for Computing Machinery, Inc. https://doi.org/10.1145/3404835.3463092
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.