This paper presents a new clustering technique named STep-wise Automatic Rival-penalized (STAR) k-means algorithm (denoted as k∗-means), which is actually a generalized version of the conventional k-means (MacQueen 1967). Not only is this new algorithm applicable to ellipse-shaped data clusters rather than just to ball-shaped ones like the k-means algorithm, but also it can perform appropriate clustering without knowing cluster number by gradually penalizing the winning chance of those extra seed points during learning competition. Although the existing RPCL (Xu et al. 1993) can automatically select the cluster number as well by driving extra seed points far away from the input data set, its performance is much sensitive to the selection of the de-learning rate. To our best knowledge, there is still no theoretical result to guide its selection as yet. In contrast, the proposed k∗-means algorithm need not determine this rate. We have qualitatively analyzed its rival-penalized mechanism with the results well-justified by the experiments.
CITATION STYLE
Cheung, Y. M. (2002). K∗-means — A generalized k-means clustering algorithm with unknown cluster number. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 2412, pp. 307–317). Springer Verlag. https://doi.org/10.1007/3-540-45675-9_48
Mendeley helps you to discover research relevant for your work.