Distance measure plays a vital role in clustering algorithms. Selecting the right distance measure for a given dataset is a challenging problem. In this paper, the effect of six distance measures on three clustering algorithms, K-means, single linkage, and average linkage is investigated. The distance measures include Euclidean, Euclidean squared, Manhattan, Mahalanobis, cosine similarity, and Pearson correlation. We describe all the distance measures pointing out their strengths and weaknesses. The performance of clustering algorithms on distance measures are evaluated on two artificial and four real-life datasets. Experimental results show the impact of distance measures when used for different datasets.
CITATION STYLE
Kumar, V., Chhabra, J. K., & Kumar, D. (2014). Impact of distance measures on the performance of clustering algorithms. In Advances in Intelligent Systems and Computing (Vol. 243, pp. 183–190). Springer Verlag. https://doi.org/10.1007/978-81-322-1665-0_17
Mendeley helps you to discover research relevant for your work.