Clustering plays a significant role in identifying the intrinsic structure of data. In this paper, various clustering algorithms are compared on real, numerical, categorical datasets around the cluster size. From the analysis, it is inferred that the repetition of KMeans many times does not bring better significant iterations since it starts randomly. It purely depends on the initial choice of the centroid of clusters. The sum of squared error decreases with increasing cluster size. The Expectation–Maximization (EM) is time-consuming than KMeans.
CITATION STYLE
Karthika, N., & Janet, B. (2020). Clustering Performance Analysis. In Advances in Intelligent Systems and Computing (Vol. 1082, pp. 25–39). Springer. https://doi.org/10.1007/978-981-15-1081-6_3
Mendeley helps you to discover research relevant for your work.