K∗-means — A generalized k-means clustering algorithm with unknown cluster number

Yiu Ming Cheung

Conference Proceedings

K∗-means — A generalized k-means clustering algorithm with unknown cluster number

Cheung Y

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2002) 2412 307-317

DOI: 10.1007/3-540-45675-9_48

10Citations

10Readers

Get full text

Abstract

This paper presents a new clustering technique named STep-wise Automatic Rival-penalized (STAR) k-means algorithm (denoted as k∗-means), which is actually a generalized version of the conventional k-means (MacQueen 1967). Not only is this new algorithm applicable to ellipse-shaped data clusters rather than just to ball-shaped ones like the k-means algorithm, but also it can perform appropriate clustering without knowing cluster number by gradually penalizing the winning chance of those extra seed points during learning competition. Although the existing RPCL (Xu et al. 1993) can automatically select the cluster number as well by driving extra seed points far away from the input data set, its performance is much sensitive to the selection of the de-learning rate. To our best knowledge, there is still no theoretical result to guide its selection as yet. In contrast, the proposed k∗-means algorithm need not determine this rate. We have qualitatively analyzed its rival-penalized mechanism with the results well-justified by the experiments.

Cite

CITATION STYLE

APA

Cheung, Y. M. (2002). K∗-means — A generalized k-means clustering algorithm with unknown cluster number. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 2412, pp. 307–317). Springer Verlag. https://doi.org/10.1007/3-540-45675-9_48

K∗-means — A generalized k-means clustering algorithm with unknown cluster number

Abstract

Cite

Register to see more suggestions