k-means Requires Exponentially Many Iterations Even in the Plane

162Citations
Citations of this article
124Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

The k-means algorithm is a well-known method for partitioning n points that lie in the d-dimensional space into k clusters. Its main features are simplicity and speed in practice. Theoretically, however, the best known upper bound on its running time (i.e., nO(kd)) is, in general, exponential in the number of points (when kd=Ω(n/log n)). Recently Arthur and Vassilvitskii (Proceedings of the 22nd Annual Symposium on Computational Geometry, pp. 144-153, 2006) showed a super-polynomial worst-case analysis, improving the best known lower bound from Ω(n) to 2Ω(√n) with a construction in d=Ω(√n) dimensions. In Arthur and Vassilvitskii (Proceedings of the 22nd Annual Symposium on Computational Geometry, pp. 144-153, 2006), they also conjectured the existence of super-polynomial lower bounds for any d≥2. Our contribution is twofold: we prove this conjecture and we improve the lower bound, by presenting a simple construction in the plane that leads to the exponential lower bound 2Ω(n). © 2011 The Author(s).

Author supplied keywords

Cite

CITATION STYLE

APA

Vattani, A. (2011). k-means Requires Exponentially Many Iterations Even in the Plane. Discrete and Computational Geometry, 45(4), 596–616. https://doi.org/10.1007/s00454-011-9340-1

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free