Improving the efficiency and efficacy of the K-means clustering algorithm through a new convergence condition

O. Joaquín Pérez; R. Rodolfo Pazos; R. Laura Cruz; S. Gerardo Reyes; T. Rosy Basave; H. Héctor Fraire

Conference Proceedings

Improving the efficiency and efficacy of the K-means clustering algorithm through a new convergence condition

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2007) 4707 LNCS(PART 3) 674-682

DOI: 10.1007/978-3-540-74484-9_58

11Citations

12Readers

Get full text

Abstract

Clustering problems arise in many different applications: machine learning, data mining, knowledge discovery, data compression, vector quantization, pattern recognition and pattern classification. One of the most popular and widely studied clustering methods is K-means. Several improvements to the standard K-means algorithm have been carried out, most of them related to the initial parameter values. In contrast, this article proposes an improvement using a new convergence condition that consists of stopping the execution when a local optimum is found or no more object exchanges among groups can be performed. For assessing the improvement attained, the modified algorithm (Early Stop K-means) was tested on six databases of the UCI repository, and the results were compared against SPSS, Weka and the standard K-means algorithm. Experimentally Early Stop K-means obtained important reductions in the number of iterations and improvements in the solution quality with respect to the other algorithms. © Springer-Verlag Berlin Heidelberg 2007.

Cite

CITATION STYLE

APA

Joaquín Pérez, O., Rodolfo Pazos, R., Laura Cruz, R., Gerardo Reyes, S., Rosy Basave, T., & Héctor Fraire, H. (2007). Improving the efficiency and efficacy of the K-means clustering algorithm through a new convergence condition. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 4707 LNCS, pp. 674–682). https://doi.org/10.1007/978-3-540-74484-9_58

Improving the efficiency and efficacy of the K-means clustering algorithm through a new convergence condition

Abstract

Cite

Register to see more suggestions