Semi-supervised K-means clustering by optimizing initial cluster centers

9Citations
Citations of this article
14Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Semi-supervised clustering uses a small amount of labeled data to aid and bias the clustering of unlabeled data. This paper explores the usage of labeled data to generate and optimize initial cluster centers for k-means algorithm. It proposes a max-distance search approach in order to find some optimal initial cluster centers from unlabeled data, especially when labeled data can't provide enough initial cluster centers. Experimental results demonstrate the advantages of this method over standard random selection and partial random selection, in which some initial cluster centers come from labeled data while the other come from unlabeled data by random selection. © 2011 Springer-Verlag.

Cite

CITATION STYLE

APA

Wang, X., Wang, C., & Shen, J. (2011). Semi-supervised K-means clustering by optimizing initial cluster centers. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 6988 LNCS, pp. 178–187). https://doi.org/10.1007/978-3-642-23982-3_23

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free