Even Faster Exact k-Means Clustering

Christian Borgelt

Conference ProceedingsOPEN ACCESS

Even Faster Exact k-Means Clustering

Borgelt C

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2020) 12080 LNCS 93-105

DOI: 10.1007/978-3-030-44584-3_8

3Citations

4Readers

Abstract

A naïve implementation of k-means clustering requires computing for each of the n data points the distance to each of the k cluster centers, which can result in fairly slow execution. However, by storing distance information obtained by earlier computations as well as information about distances between cluster centers, the triangle inequality can be exploited in different ways to reduce the number of needed distance computations, e.g. [3–5, 7, 11]. In this paper I present an improvement of the Exponion method [11] that generally accelerates the computations. Furthermore, by evaluating several methods on a fairly wide range of artificial data sets, I derive a kind of map, for which data set parameters which method (often) yields the lowest execution times.

Author supplied keywords

Cite

CITATION STYLE

APA

Borgelt, C. (2020). Even Faster Exact k-Means Clustering. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 12080 LNCS, pp. 93–105). Springer. https://doi.org/10.1007/978-3-030-44584-3_8

Even Faster Exact k-Means Clustering

Abstract

Author supplied keywords

Cite

Register to see more suggestions