Clustering Polish texts with latent semantic analysis

Marcin Kuta; Jacek Kitowski

Conference Proceedings

Clustering Polish texts with latent semantic analysis

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2010) 6114 LNAI(PART 2) 532-539

DOI: 10.1007/978-3-642-13232-2_65

3Citations

3Readers

Get full text

Abstract

The document clustering is an important technique of Natural Language Processing (NLP). The paper presents performance of partitional and agglomerative algorithms applied to clustering large number of Polish newspaper articles. We investigate different representations of the documents. The focus of the paper is on the applicability of the Latent Semantic Analysis to such clustering for Polish. © 2010 Springer-Verlag.

Author supplied keywords

Cite

CITATION STYLE

APA

Kuta, M., & Kitowski, J. (2010). Clustering Polish texts with latent semantic analysis. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 6114 LNAI, pp. 532–539). https://doi.org/10.1007/978-3-642-13232-2_65

Clustering Polish texts with latent semantic analysis

Abstract

Author supplied keywords

Cite

Register to see more suggestions