Minimally-Supervised Morphological Segmentation using Adaptor Grammars with Linguistic Priors

Ramy Eskander; Cass Lowry; Sujay Khandagale; Francesca Callejas; Judith Klavans; Maria Polinsky; Smaranda Muresan

Conference ProceedingsOPEN ACCESS

Minimally-Supervised Morphological Segmentation using Adaptor Grammars with Linguistic Priors

Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021 (2021) 3969-3974

DOI: 10.18653/v1/2021.findings-acl.347

2Citations

49Readers

Abstract

With the increasing interest in low-resource languages, unsupervised morphological segmentation has become an active area of research, where approaches based on Adaptor Grammars achieve state-of-the-art results. We demonstrate the power of harnessing linguistic knowledge as priors within Adaptor Grammars in a minimally-supervised learning fashion. We introduce two types of priors: 1) grammar definition, where we design language-specific grammars; and 2) linguist-provided affixes, collected by an expert in the language and seeded into the grammars. We use Japanese and Georgian as respective case studies for the two types of priors and introduce new datasets for these languages, with gold morphological segmentation for evaluation. We show that the use of priors results in error reductions of 8.9 % and 34.2 %, respectively, over the equivalent state-of-the-art unsupervised system.

Cite

CITATION STYLE

APA

Eskander, R., Lowry, C., Khandagale, S., Callejas, F., Klavans, J., Polinsky, M., & Muresan, S. (2021). Minimally-Supervised Morphological Segmentation using Adaptor Grammars with Linguistic Priors. In Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021 (pp. 3969–3974). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2021.findings-acl.347

Minimally-Supervised Morphological Segmentation using Adaptor Grammars with Linguistic Priors

Abstract

Cite

Register to see more suggestions