The document clustering is an important technique of Natural Language Processing (NLP). The paper presents performance of partitional and agglomerative algorithms applied to clustering large number of Polish newspaper articles. We investigate different representations of the documents. The focus of the paper is on the applicability of the Latent Semantic Analysis to such clustering for Polish. © 2010 Springer-Verlag.
CITATION STYLE
Kuta, M., & Kitowski, J. (2010). Clustering Polish texts with latent semantic analysis. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 6114 LNAI, pp. 532–539). https://doi.org/10.1007/978-3-642-13232-2_65
Mendeley helps you to discover research relevant for your work.