Text clustering based on granular computing and Wikipedia

Liping Jing; Jian Yu

Conference Proceedings

Text clustering based on granular computing and Wikipedia

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2011) 6954 LNAI 679-688

DOI: 10.1007/978-3-642-24425-4_85

0Citations

4Readers

Get full text

Abstract

Text clustering plays an important role in many real-world applications, but it is faced with various challenges, such as, curse of dimensionality, complex semantics and large volume. A lot of researches paid attention to deal with such problems by designing new text representation models and clustering algorithms. However, text clustering still remains a research problem due to the complicated properties of text data. In this paper, a text clustering procedure is proposed based on the principle of granular computing with the aid of Wikipedia. The proposed clustering method firstly identifies the text granules, especially focusing on concepts and words with the aid of Wikipedia. And then, it mines the latent patterns based on the computation of such granules. Experimental results on benchmark data sets (20Newsgroups and Reuters-21578) have shown that the proposed method improves the performance of text clustering by comparing with the existing clustering algorithm together with the existing representation models. © 2011 Springer-Verlag.

Author supplied keywords

Cite

CITATION STYLE

APA

Jing, L., & Yu, J. (2011). Text clustering based on granular computing and Wikipedia. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 6954 LNAI, pp. 679–688). https://doi.org/10.1007/978-3-642-24425-4_85

Text clustering based on granular computing and Wikipedia

Abstract

Author supplied keywords

Cite

Register to see more suggestions