Hierarchical clustering for real-time stream data with noise

Philipp Kranen; Felix Reidl; Fernando Sanchez Villaamil; Thomas Seidl

Conference Proceedings

Hierarchical clustering for real-time stream data with noise

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2011) 6809 LNCS 405-413

DOI: 10.1007/978-3-642-22351-8_25

N/ACitations

18Readers

Get full text

Abstract

In stream data mining, stream clustering algorithms provide summaries of the relevant data objects that arrived in the stream. The model size of the clustering, i.e. the granularity, is usually determined by the speed (data per time) of the data stream. For varying streams, e.g. daytime or seasonal changes in the amount of data, most algorithms have to heavily restrict their model size such that they can handle the minimal time allowance. Recently the first anytime stream clustering algorithm has been proposed that flexibly uses all available time and dynamically adapts its model size. However, the method exhibits several drawbacks, as no noise detection is performed, since every point is treated equally, and new concepts can only emerge within existing ones. In this paper we propose the LiarTree algorithm, which is capable of anytime clustering and at the same time robust against noise and novelty to deal with arbitrary data streams. © 2011 Springer-Verlag Berlin Heidelberg.

Cite

CITATION STYLE

APA

Kranen, P., Reidl, F., Sanchez Villaamil, F., & Seidl, T. (2011). Hierarchical clustering for real-time stream data with noise. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 6809 LNCS, pp. 405–413). https://doi.org/10.1007/978-3-642-22351-8_25

Hierarchical clustering for real-time stream data with noise

Abstract

Cite

Register to see more suggestions