Spectral clustering algorithm has been shown to be more effective in finding clusters than most traditional algorithms. However, spectral clustering suffers from a scalability problem in both memory use and computational time when a dataset size is large. To perform clustering on large datasets, we propose to parallelize both memory use and computation on distributed computers. Through an empirical study on a large document dataset of 193,844 data instances and a large photo dataset of 637,137, we demonstrate that our parallel algorithm can effectively alleviate the scalability problem. © 2008 Springer-Verlag Berlin Heidelberg.
CITATION STYLE
Song, Y., Chen, W. Y., Bai, H., Lin, C. J., & Chang, E. Y. (2008). Parallel spectral clustering. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 5212 LNAI, pp. 374–389). https://doi.org/10.1007/978-3-540-87481-2_25
Mendeley helps you to discover research relevant for your work.