Semantic relatedness for all (Languages): A comparative analysis of multilingual semantic relatedness using machine translation

André Freitas; Siamak Barzegar; Juliano Efson Sales; Siegfried Handschuh; Brian Davis

Conference Proceedings

Semantic relatedness for all (Languages): A comparative analysis of multilingual semantic relatedness using machine translation

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2016) 10024 LNAI 212-222

DOI: 10.1007/978-3-319-49004-5_14

12Citations

19Readers

Get full text

Abstract

This paper provides a comparative analysis of the performance of four state-of-the-art distributional semantic models (DSMs) over 11 languages, contrasting the native language-specific models with the use of machine translation over English-based DSMs. The experimental results show that there is a significant improvement (average of 16.7% for the Spearman correlation) by using state-of-the-art machine translation approaches. The results also show that the benefit of using the most informative corpus outweighs the possible errors introduced by the machine translation. For all languages, the combination of machine translation over the Word2Vec English distributional model provided the best results consistently (average Spearman correlation of 0.68).

Author supplied keywords

Cite

CITATION STYLE

APA

Freitas, A., Barzegar, S., Sales, J. E., Handschuh, S., & Davis, B. (2016). Semantic relatedness for all (Languages): A comparative analysis of multilingual semantic relatedness using machine translation. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 10024 LNAI, pp. 212–222). Springer Verlag. https://doi.org/10.1007/978-3-319-49004-5_14

Semantic relatedness for all (Languages): A comparative analysis of multilingual semantic relatedness using machine translation

Abstract

Author supplied keywords

Cite

Register to see more suggestions