We present a package which provides a general framework, including tools and algorithms, for text mining in R using the S4 class system. Using this package and the kernlab R package we explore the use of kernel methods for clustering (e.g., kernel k-means and spectral clustering) on a set of text documents, using string kernels. We compare these methods to a more traditional clustering technique like k-means on a bag of word representation of the text and evaluate the viability of kernel-based methods as a text clustering technique.
CITATION STYLE
Karatzoglou, A., & Feinerer, I. (2007). Text clustering with string kernels in R. In Studies in Classification, Data Analysis, and Knowledge Organization (pp. 91–98). Kluwer Academic Publishers. https://doi.org/10.1007/978-3-540-70981-7_11
Mendeley helps you to discover research relevant for your work.