CLINCH: Clustering incomplete high-dimensional data for data mining application

Zunping Cheng; Ding Zhou; Chen Wang; Jiankui Guo; Wei Wang; Baokang Ding; Baue Shi

Conference Proceedings

CLINCH: Clustering incomplete high-dimensional data for data mining application

Lecture Notes in Computer Science (2005) 3399 88-99

DOI: 10.1007/978-3-540-31849-1_10

2Citations

12Readers

Get full text

Abstract

Clustering is a common technique in data mining to discover hidden patterns from massive datasets. With the development of privacy-maintaining data mining application, clustering incomplete high-dimensional data has becoming more and more useful. Motivated by these limits, we develop a novel algorithm CLINCH, which could produce fine clusters on incomplete high-dimensional data space. To handle missing attributes, CLINCH employs a prediction method that can be more precise than traditional techniques. On the other hand, we also introduce an efficient way in which dimensions are processed one by one to attack the "curse of dimensionality". Experiments show that our algorithm not only outperforms many existing high-dimensional clustering algorithms in scalability and efficiency, but also produces precise results. © Springer-Verlag Berlin Heidelberg 2005.

Author supplied keywords

Cite

CITATION STYLE

APA

Cheng, Z., Zhou, D., Wang, C., Guo, J., Wang, W., Ding, B., & Shi, B. (2005). CLINCH: Clustering incomplete high-dimensional data for data mining application. In Lecture Notes in Computer Science (Vol. 3399, pp. 88–99). Springer Verlag. https://doi.org/10.1007/978-3-540-31849-1_10

CLINCH: Clustering incomplete high-dimensional data for data mining application

Abstract

Author supplied keywords

Cite

Register to see more suggestions