We present two new density-based algorithms for clustering data points in lower dimensions (dimensions ≤ 10). Both our algorithms compute density-based clusters and noises in O(n) CPU time, space, and I/O cost, under some reasonable assumptions, where n is the number of input points. Besides packing the data structure into buckets and using block access techniques to reduce the I/O cost, our algorithms apply space-filling curve techniques to reduce the disk access operations. Our first algorithm (Algorithm A) focuses on handling not highly clustered input data, while the second algorithm (Algorithm B) focuses on highly clustered input data. We implemented our algorithms, evaluated the effects of various space-filling curves, identified the best space-filling curve for our approaches, and conducted extensive performance evaluation. The experiments show the high performance of our algorithms. We believe that our algorithms are of considerable practical value. © Springer-Verlag Berlin Heidelberg 2007.
CITATION STYLE
Xu, B., & Chen, D. Z. (2007). Density-based data clustering algorithms for lower dimensions using space-filling curves. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 4426 LNAI, pp. 997–1005). Springer Verlag. https://doi.org/10.1007/978-3-540-71701-0_112
Mendeley helps you to discover research relevant for your work.