Multilingual documents clustering based on closed concepts mining

Mohamed Chebel; Chiraz Latiri; Eric Gaussier

Conference Proceedings

Multilingual documents clustering based on closed concepts mining

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2015) 9261 517-524

DOI: 10.1007/978-3-319-22849-5_36

2Citations

3Readers

Get full text

Abstract

The scarcity of bilingual and multilingual parallel corpora has prompted many researchers to accentuate the need for new methods to enhance the quality of comparable corpora. In this paper, we highlight the interest and usefulness of Formal Concept Analysis in multiligual document clustering to improve corpora comparability. We propose a statistical approach for clustering multiligual documents based on multilingual Closed Concepts Mining to partition the documents belonging to one or more collections, writing in more than one language, in a set of classes. Experimental evaluation was conducted on two collections and showed a significant improvement of comparability of the generated classes.

Cite

CITATION STYLE

APA

Chebel, M., Latiri, C., & Gaussier, E. (2015). Multilingual documents clustering based on closed concepts mining. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 9261, pp. 517–524). Springer Verlag. https://doi.org/10.1007/978-3-319-22849-5_36

Multilingual documents clustering based on closed concepts mining

Abstract

Cite

Register to see more suggestions