Abstract
We replicate the syntactic experiments of Mikolov et al. (2013b) on English, and expand them to include morphologically complex languages. We learn vector representations for Dutch, French, German, and Spanish with the WORD2VEC tool, and investigate to what extent inflectional information is preserved across vectors. We observe that the accuracy of vectors on a set of syntactic analogies is inversely correlated with the morphological complexity of the language.
Cite
CITATION STYLE
Nicolai, G., Cherry, C., & Kondrak, G. (2015). Morpho-syntactic regularities in continuous word representations: A multilingual study. In 1st Workshop on Vector Space Modeling for Natural Language Processing, VS 2015 at the Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL-HLT 2015 (pp. 129–134). Association for Computational Linguistics (ACL). https://doi.org/10.3115/v1/w15-1518
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.