To compress or not to compress? A finite-state approach to nen verbal morphology

4Citations
Citations of this article
62Readers
Mendeley users who have this article in their library.

Abstract

This paper describes the development of a verbal morphological parser for an under-resourced Papuan language, Nen. Nen verbal morphology is particularly complex, with a transitive verb taking up to 1, 740 unique features. The structural properties exhibited by Nen verbs raises interesting choices for analysis. Here we compare two possible methods of analysis: ‘Chunking’ and decomposition. ‘Chunking’ refers to the concept of collating morphological segments into one, whereas the decomposition model follows a more classical linguistic approach. Both models are built using the Finite-State Transducer toolkit foma. The resultant architecture shows differences in size and structural clarity. While the ‘Chunking’ model is under half the size of the full decomposed counterpart, the decomposition displays higher structural order. In this paper, we describe the challenges encountered when modelling a language exhibiting distributed exponence and present the first morphological analyser for Nen, with an overall accuracy of 80.3%.

Cite

CITATION STYLE

APA

Muradoglu, S., Evans, N., & Suominen, H. (2020). To compress or not to compress? A finite-state approach to nen verbal morphology. In Proceedings of the Annual Meeting of the Association for Computational Linguistics (pp. 207–213). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2020.acl-srw.28

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free