Impact of distance measures on the performance of clustering algorithms

Vijay Kumar; Jitender Kumar Chhabra; Dinesh Kumar

Conference Proceedings

Impact of distance measures on the performance of clustering algorithms

Advances in Intelligent Systems and Computing (2014) 243 183-190

DOI: 10.1007/978-81-322-1665-0_17

12Citations

27Readers

Get full text

Abstract

Distance measure plays a vital role in clustering algorithms. Selecting the right distance measure for a given dataset is a challenging problem. In this paper, the effect of six distance measures on three clustering algorithms, K-means, single linkage, and average linkage is investigated. The distance measures include Euclidean, Euclidean squared, Manhattan, Mahalanobis, cosine similarity, and Pearson correlation. We describe all the distance measures pointing out their strengths and weaknesses. The performance of clustering algorithms on distance measures are evaluated on two artificial and four real-life datasets. Experimental results show the impact of distance measures when used for different datasets.

Author supplied keywords

Cite

CITATION STYLE

APA

Kumar, V., Chhabra, J. K., & Kumar, D. (2014). Impact of distance measures on the performance of clustering algorithms. In Advances in Intelligent Systems and Computing (Vol. 243, pp. 183–190). Springer Verlag. https://doi.org/10.1007/978-81-322-1665-0_17

Impact of distance measures on the performance of clustering algorithms

Abstract

Author supplied keywords

Cite

Register to see more suggestions