Abstract
This paper presents an efficient acceleration algorithm for Lloyd-type k-means clustering, which is suitable to a large-scale and highdimensional data set with potentially numerous classes. The algorithm employs a novel projection-based filter (PRJ) to avoid unnecessary distance calculations, resulting in high-speed performance keeping the same results as a standard Lloyd's algorithm. The PRJ exploits a summable lower bound on a squared distance defined in a lower-dimensional space to which data points are projected. The summable lower bound can make the bound tighter dynamically by incremental addition of components in the lowerdimensional space within each iteration although the existing lower bounds used in other acceleration algorithms work only once as a fixed filter. Experimental results on large-scale and high-dimensional real image data sets demonstrate that the proposed algorithm works at high speed and with low memory consumption when large k values are given, compared with the state-of-the-art algorithms.
Author supplied keywords
Cite
CITATION STYLE
Aoyama, K., Saito, K., & Ikeda, T. (2018). Accelerating a lloyd-type k-means clustering algorithm with summable lower bounds in a lower-dimensional space. IEICE Transactions on Information and Systems, E101D(11), 2773–2783. https://doi.org/10.1587/transinf.2017EDP7392
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.