Combining heterogeneous knowledge resources for improved distributional semantic models

György Szarvas; Torsten Zesch; Iryna Gurevych

Conference Proceedings

Combining heterogeneous knowledge resources for improved distributional semantic models

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2011) 6608 LNCS(PART 1) 289-303

DOI: 10.1007/978-3-642-19400-9_23

4Citations

16Readers

Get full text

Abstract

The Explicit Semantic Analysis (ESA) model based on term cooccurrences in Wikipedia has been regarded as state-of-the-art semantic relatedness measure in the recent years. We provide an analysis of the important parameters of ESA using datasets in five different languages. Additionally, we propose the use of ESA with multiple lexical semantic resources thus exploiting multiple evidence of term cooccurrence to improve over the Wikipedia-based measure. Exploiting the improved robustness and coverage of the proposed combination, we report improved performance over single resources in word semantic relatedness, solving word choice problems, classification of semantic relations between nominals, and text similarity. © 2011 Springer-Verlag.

Cite

CITATION STYLE

APA

Szarvas, G., Zesch, T., & Gurevych, I. (2011). Combining heterogeneous knowledge resources for improved distributional semantic models. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 6608 LNCS, pp. 289–303). https://doi.org/10.1007/978-3-642-19400-9_23

Combining heterogeneous knowledge resources for improved distributional semantic models

Abstract

Cite

Register to see more suggestions