Text mining with hybrid biclustering algorithms

Patryk Orzechowski; Krzysztof Boryczko

Conference Proceedings

Text mining with hybrid biclustering algorithms

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2016) 9693 102-113

DOI: 10.1007/978-3-319-39384-1_9

19Citations

11Readers

Get full text

Abstract

Text data mining is the process of extracting valuable information from a dataset consisting of text documents. Popular clustering algorithms do not allow detection of the same words appearing in multiple documents. Instead, they discover general similarity of such documents. This article presents the application of a hybrid biclustering algorithm for text mining documents collected from Twitter and symbolic analysis of knowledge spreadsheets. The proposed method automatically reveals words appearing together in multiple texts. The proposed approach is compared to some of the most recognized clustering algorithms and shows the advantage of biclustering over clustering in text mining. Finally, the method is confronted with other biclustering methods in the task of classification.

Cite

CITATION STYLE

APA

Orzechowski, P., & Boryczko, K. (2016). Text mining with hybrid biclustering algorithms. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 9693, pp. 102–113). Springer Verlag. https://doi.org/10.1007/978-3-319-39384-1_9

Text mining with hybrid biclustering algorithms

Abstract

Cite

Register to see more suggestions