StreamLeader: A new stream clustering algorithm not based in conventional clustering

Jaime Andrés-Merino; Lluís A. Belanche

Conference Proceedings

StreamLeader: A new stream clustering algorithm not based in conventional clustering

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2016) 9887 LNCS 208-215

DOI: 10.1007/978-3-319-44781-0_25

0Citations

15Readers

Get full text

Abstract

Stream clustering algorithms normally require two phases: an online first step that statistically summarizes the stream while forming special structures – such as micro-clusters– and a second, offline phase, that uses a conventional clustering algorithm taking the microclusters as pseudo-points to deliver the final clustering. This procedure tends to produce oversized or overlapping clusters in medium-to-high dimensional spaces, and typically degrades seriously in noisy data environments. In this paper we introduce StreamLeader, a novel stream clustering algorithm suitable to massive data that does not resort to a conventional clustering phase, being based on the notion of Leader Cluster and on an aggressive noise reduction process. We report an extensive systematic testing in which the new algorithm is shown to consistently outperform its contenders both in terms of quality and scalability.

Author supplied keywords

Cite

CITATION STYLE

APA

Andrés-Merino, J., & Belanche, L. A. (2016). StreamLeader: A new stream clustering algorithm not based in conventional clustering. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 9887 LNCS, pp. 208–215). Springer Verlag. https://doi.org/10.1007/978-3-319-44781-0_25

StreamLeader: A new stream clustering algorithm not based in conventional clustering

Abstract

Author supplied keywords

Cite

Register to see more suggestions