Clustering problems arise in many different applications: machine learning, data mining, knowledge discovery, data compression, vector quantization, pattern recognition and pattern classification. One of the most popular and widely studied clustering methods is K-means. Several improvements to the standard K-means algorithm have been carried out, most of them related to the initial parameter values. In contrast, this article proposes an improvement using a new convergence condition that consists of stopping the execution when a local optimum is found or no more object exchanges among groups can be performed. For assessing the improvement attained, the modified algorithm (Early Stop K-means) was tested on six databases of the UCI repository, and the results were compared against SPSS, Weka and the standard K-means algorithm. Experimentally Early Stop K-means obtained important reductions in the number of iterations and improvements in the solution quality with respect to the other algorithms. © Springer-Verlag Berlin Heidelberg 2007.
CITATION STYLE
Joaquín Pérez, O., Rodolfo Pazos, R., Laura Cruz, R., Gerardo Reyes, S., Rosy Basave, T., & Héctor Fraire, H. (2007). Improving the efficiency and efficacy of the K-means clustering algorithm through a new convergence condition. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 4707 LNCS, pp. 674–682). https://doi.org/10.1007/978-3-540-74484-9_58
Mendeley helps you to discover research relevant for your work.