Spherical k -Means Clustering

Kurt Hornik; Ingo Feinerer; Martin Kober; Christian Buchta

Journal ArticleOPEN ACCESS

Spherical k -Means Clustering

Hornik K
Feinerer I
Kober M
et al.

Journal of Statistical Software (2012) 50(10)

DOI: 10.18637/jss.v050.i10

N/ACitations

124Readers

Abstract

Clustering text documents is a fundamental task in modern data analysis, requiring approaches which perform well both in terms of solution quality and computational efficiency. Spherical k-means clustering is one approach to address both issues, employing cosine dissimilarities to perform prototype-based partitioning of term weight representations of the documents. This paper presents the theory underlying the standard spherical k-means problem and suitable extensions, and introduces the R extension package skmeans which provides a computational environment for spherical k-means clustering featuring several solvers: axed-point and genetic algorithm, and interfaces to two external solvers (CLUTO and Gmeans). Performance of these solvers is investigated by means of a large scale benchmark experiment.

Cite

CITATION STYLE

APA

Hornik, K., Feinerer, I., Kober, M., & Buchta, C. (2012). Spherical k -Means Clustering. Journal of Statistical Software, 50(10). https://doi.org/10.18637/jss.v050.i10

Spherical k -Means Clustering

Abstract

Cite

Register to see more suggestions