Using morphological knowledge in open-vocabulary neural language models

Austin Matthews; Graham Neubig; Chris Dyer

Conference ProceedingsOPEN ACCESS

Using morphological knowledge in open-vocabulary neural language models

NAACL HLT 2018 - 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies - Proceedings of the Conference (2018) 1 1435-1445

DOI: 10.18653/v1/n18-1130

13Citations

124Readers

Abstract

Languages with productive morphology pose problems for language models that generate words from a fixed vocabulary. Although character-based models allow any possible word type to be generated, they are linguistically naïve: They must discover that words exist and are delimited by spaces-basic linguistic facts that are built in to the structure of word-based models. We introduce an openvocabulary language model that incorporates more sophisticated linguistic knowledge by predicting words using a mixture of three generative processes: (1) by generating words as a sequence of characters, (2) by directly generating full word forms, and (3) by generating words as a sequence of morphemes that are combined using a hand-written morphological analyzer. Experiments on Finnish, Turkish, and Russian show that our model outperforms character sequence models and other strong baselines on intrinsic and extrinsic measures. Furthermore, we show that our model learns to exploit morphological knowledge encoded in the analyzer, and, as a byproduct, it can perform effective unsupervised morphological disambiguation.

Cite

CITATION STYLE

APA

Matthews, A., Neubig, G., & Dyer, C. (2018). Using morphological knowledge in open-vocabulary neural language models. In NAACL HLT 2018 - 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies - Proceedings of the Conference (Vol. 1, pp. 1435–1445). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/n18-1130

Using morphological knowledge in open-vocabulary neural language models

Abstract

Cite

Register to see more suggestions