How the initialization affects the stability of the k-means algorithm

35Citations
Citations of this article
65Readers
Mendeley users who have this article in their library.

Abstract

We investigate the role of the initialization for the stability of the k-means clustering algorithm. As opposed to other papers, we consider the actual k-means algorithm (also known as Lloyd algorithm). In particular we leverage on the property that this algorithm can get stuck in local optima of the k-means objective function. We are interested in the actual clustering, not only in the costs of the solution. We analyze when different initializations lead to the same local optimum, and when they lead to different local optima. This enables us to prove that it is reasonable to select the number of clusters based on stability scores. © EDP Sciences, SMAI 2012.

Cite

CITATION STYLE

APA

Bubeck, S., MeilĂ, M., & Luxburg, U. V. (2012). How the initialization affects the stability of the k-means algorithm. ESAIM - Probability and Statistics, 16, 436–452. https://doi.org/10.1051/ps/2012013

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free