AGRID: An efficient algorithm for clustering large high-dimensional datasets

Yanchang Zhao; Junde Song

Conference Proceedings

AGRID: An efficient algorithm for clustering large high-dimensional datasets

Lecture Notes in Artificial Intelligence (Subseries of Lecture Notes in Computer Science) (2003) 2637 271-282

DOI: 10.1007/3-540-36175-8_27

10Citations

4Readers

Get full text

Abstract

The clustering algorithm GDILC relies on density-based clustering with grid and is designed to discover clusters of arbitrary shapes and eliminate noises. However, it is not scalable to large high-dimensional datasets. In this paper, we improved this algorithm in five important directions. Through these improvements, AGRID is of high scalability and can process large high-dimensional datasets. It can discover clusters of various shapes and eliminate noises effectively. Besides, it is insensitive to the order of input and is a non-parametric algorithm. The high speed and accuracy of the AGRID clustering algorithm was shown in our experiments.

Author supplied keywords

Cite

CITATION STYLE

APA

Zhao, Y., & Song, J. (2003). AGRID: An efficient algorithm for clustering large high-dimensional datasets. In Lecture Notes in Artificial Intelligence (Subseries of Lecture Notes in Computer Science) (Vol. 2637, pp. 271–282). Springer Verlag. https://doi.org/10.1007/3-540-36175-8_27

AGRID: An efficient algorithm for clustering large high-dimensional datasets

Abstract

Author supplied keywords

Cite

Register to see more suggestions