Morphological smoothing and extrapolation of word embeddings

Ryan Cotterell; Hinrich Schütze; Jason Eisner

Conference ProceedingsOPEN ACCESS

Morphological smoothing and extrapolation of word embeddings

54th Annual Meeting of the Association for Computational Linguistics, ACL 2016 - Long Papers (2016) 3 1651-1660

DOI: 10.18653/v1/p16-1156

46Citations

121Readers

Abstract

Languages with rich inflectional morphology exhibit lexical data sparsity, since the word used to express a given concept will vary with the syntactic context. For instance, each count noun in Czech has 12 forms (where English uses only singular and plural). Even in large corpora, we are unlikely to observe all inflections of a given lemma. This reduces the vocabulary coverage of methods that induce continuous representations for words from distributional corpus information. We solve this problem by exploiting existing morphological resources that can enumerate a word's component morphemes. We present a latentvariable Gaussian graphical model that allows us to extrapolate continuous representations for words not observed in the training corpus, as well as smoothing the representations provided for the observed words. The latent variables represent embeddings of morphemes, which combine to create embeddings of words. Over several languages and training sizes, our model improves the embeddings for words, when evaluated on an analogy task, skip-gram predictive accuracy, and word similarity.

Cite

CITATION STYLE

APA

Cotterell, R., Schütze, H., & Eisner, J. (2016). Morphological smoothing and extrapolation of word embeddings. In 54th Annual Meeting of the Association for Computational Linguistics, ACL 2016 - Long Papers (Vol. 3, pp. 1651–1660). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/p16-1156

Morphological smoothing and extrapolation of word embeddings

Abstract

Cite

Register to see more suggestions