Abstract
We introduce a model-free relax-and-round algorithm for k-means clustering based on a semidefinite relaxation due to Peng andWei (2007, SIAM J. Optim., 18, 186-205). The algorithm interprets the output of the semidefinite program as a denoised version of the original data and then rounds this output to a hard clustering.We provide a generic method for proving performance guarantees for this algorithm, and we analyse the algorithm in the context of subgaussian mixture models. We also study the fundamental limits of estimating Gaussian centers by k-means clustering to compare our approximation guarantee to the theoretically optimal k-means clustering solution.
Author supplied keywords
Cite
CITATION STYLE
Mixon, D. G., Villar, S., & Ward, R. (2017). Clustering subgaussian mixtures by semidefinite programming. Information and Inference, 6(4), 389–415. https://doi.org/10.1093/imaiai/iax001
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.