SEW-EMBED at SemEval-2017 Task 2: Language-Independent Concept Representations from a Semantically Enriched Wikipedia

Claudio Delli Bovi; Alessandro Raganato

Conference ProceedingsOPEN ACCESS

SEW-EMBED at SemEval-2017 Task 2: Language-Independent Concept Representations from a Semantically Enriched Wikipedia

Proceedings of the Annual Meeting of the Association for Computational Linguistics (2017) 261-266

DOI: 10.18653/v1/s17-2041

2Citations

76Readers

Abstract

This paper describes SEW-EMBED, our language-independent approach to multilingual and cross-lingual semantic word similarity as part of the SemEval-2017 Task 2. We leverage the Wikipedia-based concept representations developed by Raganato et al. (2016), and propose an embedded augmentation of their explicit high-dimensional vectors, which we obtain by plugging in an arbitrary word (or sense) embedding representation, and computing a weighted average in the continuous vector space. We evaluate SEW-EMBED with two different off-the-shelf embedding representations, and report their performances across all monolingual and cross-lingual benchmarks available for the task. Despite its simplicity, especially compared with supervised or overly tuned approaches, SEW-EMBED achieves competitive results in the cross-lingual setting (3rd best result in the global ranking of subtask 2, score 0.56).

Cite

CITATION STYLE

APA

Bovi, C. D., & Raganato, A. (2017). SEW-EMBED at SemEval-2017 Task 2: Language-Independent Concept Representations from a Semantically Enriched Wikipedia. In Proceedings of the Annual Meeting of the Association for Computational Linguistics (pp. 261–266). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/s17-2041

SEW-EMBED at SemEval-2017 Task 2: Language-Independent Concept Representations from a Semantically Enriched Wikipedia

Abstract

Cite

Register to see more suggestions