Partition-based clustering with sliding windows for data streams

Jonghem Youn; Jihun Choi; Junho Shim; Sang Goo Lee

Conference Proceedings

Partition-based clustering with sliding windows for data streams

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2017) 10178 LNCS 289-303

DOI: 10.1007/978-3-319-55699-4_18

2Citations

5Readers

Get full text

Abstract

Data stream clustering with sliding windows generates clusters for every window movement. Because repeated clustering on all changed windows is highly inefficient in terms of memory and computation time, a clustering algorithm should be designed with considering only inserted and deleted tuples of windows. In this paper, we address this problem by sliding window aggregation technique and cluster modification strategy. We propose a novel data structure for construction and maintenance of 2-level synopses. This data structure enables to update synopses efficiently and support precise sliding window operations. We also suggest a modification strategy to decide whether to append new synopses to pre-existing clusters or perform clustering on whole synopses according to the difference between probability distributions of the original and updated clusters. Experimental results show that proposed method outperforms state-of-the-art methods.

Author supplied keywords

Cite

CITATION STYLE

APA

Youn, J., Choi, J., Shim, J., & Lee, S. G. (2017). Partition-based clustering with sliding windows for data streams. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 10178 LNCS, pp. 289–303). Springer Verlag. https://doi.org/10.1007/978-3-319-55699-4_18

Partition-based clustering with sliding windows for data streams

Abstract

Author supplied keywords

Cite

Register to see more suggestions