Comparing fuzzy-C means and K-means clustering techniques: A comprehensive study

71Citations
Citations of this article
65Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Clustering techniques are unsupervised learning methods of grouping similar from dissimilar data types. Therefore, these are popular for various data mining and pattern recognition purposes. However, their performances are data dependent. Thus, choosing right clustering technique for a given dataset is a research challenge. In this paper, we have tested the performances of a Soft clustering (e.g., Fuzzy C means or FCM) and a Hard clustering technique (e.g., K-means or KM) on Iris (150 x 4); Wine (178 x 13) and Lens (24 x 4) datasets. Distance measure is the heart of any clustering algorithm to compute the similarity between any two data. Two distance measures such as Manhattan (MH) and Euclidean (ED) are used to note how these influence the overall clustering performance. The performance has been compared based on seven parameters: (i) sensitivity, (ii) specificity, (iii) precision, (iv) accuracy, (v) run time, (vi) average intra cluster distance (i.e. compactness of the clusters) and (vii) inter cluster distance (i.e. distinctiveness of the clusters). Based on the experimental results, the paper concludes that both KM and FCM have performed well. However, KM outperforms FCM in terms of speed. FCM-MH combination produces most compact clusters, while KM-ED yields most distinct clusters. © 2012 Springer-Verlag GmbH.

Cite

CITATION STYLE

APA

Panda, S., Sahu, S., Jena, P., & Chattopadhyay, S. (2012). Comparing fuzzy-C means and K-means clustering techniques: A comprehensive study. In Advances in Intelligent and Soft Computing (Vol. 166 AISC, pp. 451–460). https://doi.org/10.1007/978-3-642-30157-5_45

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free