Parallel spectral clustering

Yangqiu Song; Wen Yen Chen; Hongjie Bai; Chih Jen Lin; Edward Y. Chang

Conference ProceedingsOPEN ACCESS

Parallel spectral clustering

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2008) 5212 LNAI(PART 2) 374-389

DOI: 10.1007/978-3-540-87481-2_25

57Citations

96Readers

Abstract

Spectral clustering algorithm has been shown to be more effective in finding clusters than most traditional algorithms. However, spectral clustering suffers from a scalability problem in both memory use and computational time when a dataset size is large. To perform clustering on large datasets, we propose to parallelize both memory use and computation on distributed computers. Through an empirical study on a large document dataset of 193,844 data instances and a large photo dataset of 637,137, we demonstrate that our parallel algorithm can effectively alleviate the scalability problem. © 2008 Springer-Verlag Berlin Heidelberg.

Author supplied keywords

Cite

CITATION STYLE

APA

Song, Y., Chen, W. Y., Bai, H., Lin, C. J., & Chang, E. Y. (2008). Parallel spectral clustering. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 5212 LNAI, pp. 374–389). https://doi.org/10.1007/978-3-540-87481-2_25

Parallel spectral clustering

Abstract

Author supplied keywords

Cite

Register to see more suggestions