Big data clustering: A review

Ali Seyed Shirkhorshidi; Saeed Aghabozorgi; Teh Ying Wah; Tutut Herawan

Conference Proceedings

Big data clustering: A review

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2014) 8583 LNCS(PART 5) 707-720

DOI: 10.1007/978-3-319-09156-3_49

194Citations

466Readers

Get full text

Abstract

Clustering is an essential data mining and tool for analyzing big data. There are difficulties for applying clustering techniques to big data duo to new challenges that are raised with big data. As Big Data is referring to terabytes and petabytes of data and clustering algorithms are come with high computational costs, the question is how to cope with this problem and how to deploy clustering techniques to big data and get the results in a reasonable time. This study is aimed to review the trend and progress of clustering algorithms to cope with big data challenges from very first proposed algorithms until today's novel solutions. The algorithms and the targeted challenges for producing improved clustering algorithms are introduced and analyzed, and afterward the possible future path for more advanced algorithms is illuminated based on today's available technologies and frameworks. © 2014 Springer International Publishing.

Author supplied keywords

Cite

CITATION STYLE

APA

Shirkhorshidi, A. S., Aghabozorgi, S., Wah, T. Y., & Herawan, T. (2014). Big data clustering: A review. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 8583 LNCS, pp. 707–720). Springer Verlag. https://doi.org/10.1007/978-3-319-09156-3_49

Big data clustering: A review

Abstract

Author supplied keywords

Cite

Register to see more suggestions