It is challenging to perform k-means clustering on a large scale dataset efficiently. One of the reasons is that k-means needs to scan a batch of training data to update the cluster centers at every iteration, which is time-consuming. In the paper, we propose a variance reduced k-means VRKM, which outperforms the state-of-the-art method, and obtain 4× speedup for large-scale clustering. The source code is available on https://github.com/YaweiZhao/VRKM_sofia-ml.
CITATION STYLE
Zhao, Y., Ming, Y., Liu, X., Zhu, E., & Yin, J. (2018). Variance reduced K-means clustering. In 32nd AAAI Conference on Artificial Intelligence, AAAI 2018 (pp. 8187–8188). AAAI press. https://doi.org/10.1609/aaai.v32i1.12135
Mendeley helps you to discover research relevant for your work.