Fighting with the sparsity of synonymy dictionaries for automatic synset induction

Dmitry Ustalov; Mikhail Chernoskutov; Chris Biemann; Alexander Panchenko

Conference Proceedings

Fighting with the sparsity of synonymy dictionaries for automatic synset induction

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2018) 10716 LNCS 94-105

DOI: 10.1007/978-3-319-73013-4_9

5Citations

5Readers

Get full text

Abstract

Graph-based synset induction methods, such as MaxMax and Watset, induce synsets by performing a global clustering of a synonymy graph. However, such methods are sensitive to the structure of the input synonymy graph: sparseness of the input dictionary can substantially reduce the quality of the extracted synsets. In this paper, we propose two different approaches designed to alleviate the incompleteness of the input dictionaries. The first one performs a pre-processing of the graph by adding missing edges, while the second one performs a post-processing by merging similar synset clusters. We evaluate these approaches on two datasets for the Russian language and discuss their impact on the performance of synset induction methods. Finally, we perform an extensive error analysis of each approach and discuss prominent alternative methods for coping with the problem of sparsity of the synonymy dictionaries.

Author supplied keywords

Cite

CITATION STYLE

APA

Ustalov, D., Chernoskutov, M., Biemann, C., & Panchenko, A. (2018). Fighting with the sparsity of synonymy dictionaries for automatic synset induction. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 10716 LNCS, pp. 94–105). Springer Verlag. https://doi.org/10.1007/978-3-319-73013-4_9

Fighting with the sparsity of synonymy dictionaries for automatic synset induction

Abstract

Author supplied keywords

Cite

Register to see more suggestions