CLINCH: Clustering incomplete high-dimensional data for data mining application

2Citations
Citations of this article
12Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Clustering is a common technique in data mining to discover hidden patterns from massive datasets. With the development of privacy-maintaining data mining application, clustering incomplete high-dimensional data has becoming more and more useful. Motivated by these limits, we develop a novel algorithm CLINCH, which could produce fine clusters on incomplete high-dimensional data space. To handle missing attributes, CLINCH employs a prediction method that can be more precise than traditional techniques. On the other hand, we also introduce an efficient way in which dimensions are processed one by one to attack the "curse of dimensionality". Experiments show that our algorithm not only outperforms many existing high-dimensional clustering algorithms in scalability and efficiency, but also produces precise results. © Springer-Verlag Berlin Heidelberg 2005.

Cite

CITATION STYLE

APA

Cheng, Z., Zhou, D., Wang, C., Guo, J., Wang, W., Ding, B., & Shi, B. (2005). CLINCH: Clustering incomplete high-dimensional data for data mining application. In Lecture Notes in Computer Science (Vol. 3399, pp. 88–99). Springer Verlag. https://doi.org/10.1007/978-3-540-31849-1_10

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free