Towards a universal semantic dictionary

3Citations
Citations of this article
10Readers
Mendeley users who have this article in their library.

Abstract

A novel method for finding linear mappings among word embeddings for several languages, taking as pivot a shared, multilingual embedding space, is proposed in this paper. Previous approaches learned translation matrices between two specific languages, while this method learns translation matrices between a given language and a shared, multilingual space. The system was first trained on bilingual, and later on multilingual corpora as well. In the first case, two different training data were applied: Dinu's English-Italian benchmark data, and English-Italian translation pairs extracted from the PanLex database. In the second case, only the PanLex database was used. The system performs on English-Italian languages with the best setting significantly better than the baseline system given by Mikolov, and it provides a comparable performance with more sophisticated systems. Exploiting the richness of the PanLex database, the proposed method makes it possible to learn linear mappings among an arbitrary number of languages.

Cite

CITATION STYLE

APA

Castro-Bleda, M. J., Iklódi, E., Recski, G., & Borbély, G. (2019). Towards a universal semantic dictionary. Applied Sciences (Switzerland), 9(19). https://doi.org/10.3390/app9194060

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free