Dynamic incremental data summarization for hierarchical clustering

Bing Liu; Yuliang Shi; Zhihui Wang; Wei Wang; Baile Shi

Conference Proceedings

Dynamic incremental data summarization for hierarchical clustering

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2006) 4016 LNCS 410-421

DOI: 10.1007/11775300_35

1Citations

7Readers

Get full text

Abstract

In many real world applications, with the databases frequent insertions and deletions, the ability of a data mining technique to detect and react quickly to dynamic changes in the data distribution and clustering over time is highly desired. Data summarizations (e.g., data bubbles) have been proposed to compress large databases into representative points suitable for subsequent hierarchical cluster analysis. In this paper, we thoroughly investigate the quality measure (data summarization index) of incremental data bubbles. When updating databases, we show which factors could affect the mean and standard deviation of data summarization index or not. Based on these statements, a fully dynamic scheme to maintain data bubbles incrementally is proposed. An extensive experimental evaluation confirms our statements and shows that the fully dynamic incremental data bubbles are effective in preserving the quality of the data summarization for hierarchical clustering. © Springer-Verlag Berlin Heidelberg 2006.

Cite

CITATION STYLE

APA

Liu, B., Shi, Y., Wang, Z., Wang, W., & Shi, B. (2006). Dynamic incremental data summarization for hierarchical clustering. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 4016 LNCS, pp. 410–421). Springer Verlag. https://doi.org/10.1007/11775300_35

Dynamic incremental data summarization for hierarchical clustering

Abstract

Cite

Register to see more suggestions