Simple task-specific bilingual word embeddings

Stephan Gouws; Anders Søgaard

Conference ProceedingsOPEN ACCESS

Simple task-specific bilingual word embeddings

NAACL HLT 2015 - 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Proceedings of the Conference (2015) 1386-1390

DOI: 10.3115/v1/n15-1157

99Citations

158Readers

Abstract

We introduce a simple wrapper method that uses off-the-shelf word embedding algorithms to learn task-specific bilingual word embeddings. We use a small dictionary of easily-obtainable task-specific word equivalence classes to produce mixed context-target pairs that we use to train off-the-shelf embedding models. Our model has the advantage that it (a) is independent of the choice of embedding algorithm, (b) does not require parallel data, and (c) can be adapted to specific tasks by re-defining the equivalence classes. We show how our method outperforms off-the-shelf bilingual embeddings on the task of unsupervised cross-language part-of-speech (POS) tagging, as well as on the task of semi-supervised cross-language super sense (SuS) tagging.

Cite

CITATION STYLE

APA

Gouws, S., & Søgaard, A. (2015). Simple task-specific bilingual word embeddings. In NAACL HLT 2015 - 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Proceedings of the Conference (pp. 1386–1390). Association for Computational Linguistics (ACL). https://doi.org/10.3115/v1/n15-1157

Simple task-specific bilingual word embeddings

Abstract

Cite

Register to see more suggestions