Single pass kernel k-means clustering method

T. Hitendra Sarma; P. Viswanath; B. Eswara Reddy

Journal ArticleOPEN ACCESS

Single pass kernel k-means clustering method

Sadhana - Academy Proceedings in Engineering Sciences (2013) 38(3) 407-419

DOI: 10.1007/s12046-013-0143-3

16Citations

12Readers

Get full text

Abstract

In unsupervised classification, kernel k-means clustering method has been shown to perform better than conventional k-means clustering method in identifying non-isotropic clusters in a data set. The space and time requirements of this method are O(n 2), where n is the data set size. Because of this quadratic time complexity, the kernel k-means method is not applicable to work with large data sets. The paper proposes a simple and faster version of the kernel k-means clustering method, called single pass kernel k -means clustering method. The proposed method works as follows. First, a random sample S is selected from the data set D. A partition ΠS is obtained by applying the conventional kernel k-means method on the random sample S. The novelty of the paper is, for each cluster in ΠS, the exact cluster center in the input space is obtained using the gradient descent approach. Finally, each unsampled pattern is assigned to its closest exact cluster center to get a partition of the entire data set. The proposed method needs to scan the data set only once and it is much faster than the conventional kernel k-means method. The time complexity of this method is O(s 2 + t + nk) where s is the size of the random sample S, k is the number of clusters required, and t is the time taken by the gradient descent method (to find exact cluster centers). The space complexity of the method is O(s 2). The proposed method can be easily implemented and is suitable for large data sets, like those in data mining applications. Experimental results show that, with a small loss of quality, the proposed method can significantly reduce the time taken than the conventional kernel k-means clustering method. The proposed method is also compared with other recent similar methods. © 2013 Indian Academy of Sciences.

Author supplied keywords

Cite

CITATION STYLE

APA

Sarma, T. H., Viswanath, P., & Reddy, B. E. (2013). Single pass kernel k-means clustering method. Sadhana - Academy Proceedings in Engineering Sciences, 38(3), 407–419. https://doi.org/10.1007/s12046-013-0143-3

Single pass kernel k-means clustering method

Abstract

Author supplied keywords

Cite

Register to see more suggestions