Abstract
This chapter presents a new framework, HPStream, for high-dimensional projected clustering of data streams. It finds projected clusters in particular subsets of the dimensions by maintaining condensed representations of the clusters over time. The algorithm provides better quality clusters than full dimensional data stream clustering algorithms. The chapter analyzes the algorithm on a number of real and synthetic data sets. In each case, it is found that the HPStream algorithm is more effective than the full dimensional CluStream algorithm. High-dimensional projected clustering of data streams opens a new direction for exploration of stream data mining. With this methodology, one can treat projected clustering as a preprocessing step that may promote more effective methods for stream classification, similarity, evolution, and outlier analysis.
Cite
CITATION STYLE
Aggarwal, C. C., Yu, P. S., Han, J., & Wang, J. (2004). A Framework for Projected Clustering of High Dimensional Data Streams. In Proceedings 2004 VLDB Conference: The 30th International Conference on Very Large Databases (VLDB) (pp. 852–863). Elsevier. https://doi.org/10.1016/B978-012088469-8.50075-9
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.