Clustering is a common technique in data mining to discover hidden patterns from massive datasets. With the development of privacy-maintaining data mining application, clustering incomplete high-dimensional data has becoming more and more useful. Motivated by these limits, we develop a novel algorithm CLINCH, which could produce fine clusters on incomplete high-dimensional data space. To handle missing attributes, CLINCH employs a prediction method that can be more precise than traditional techniques. On the other hand, we also introduce an efficient way in which dimensions are processed one by one to attack the "curse of dimensionality". Experiments show that our algorithm not only outperforms many existing high-dimensional clustering algorithms in scalability and efficiency, but also produces precise results. © Springer-Verlag Berlin Heidelberg 2005.
CITATION STYLE
Cheng, Z., Zhou, D., Wang, C., Guo, J., Wang, W., Ding, B., & Shi, B. (2005). CLINCH: Clustering incomplete high-dimensional data for data mining application. In Lecture Notes in Computer Science (Vol. 3399, pp. 88–99). Springer Verlag. https://doi.org/10.1007/978-3-540-31849-1_10
Mendeley helps you to discover research relevant for your work.