Generalized Glossing Guidelines: An Explicit, Human- and Machine-Readable, Item-and-Process Convention for Morphological Annotation

6Citations
Citations of this article
11Readers
Mendeley users who have this article in their library.
Get full text

Abstract

We introduce a YAML notation for multi-line interlinear glossed text (IGT) that represents non-concatentative processes such as infixation, reduplication, mutation, truncation, and tonal overwriting in a consistent, formally rigorous way, on par with affixation, using an Item-and-Process (IP) framework. Our new notation - Generalized Glossing Guidelines (GGG) - is human- and machine-readable and easy to edit with general purpose tools. A GGG representation has four fields: (1) A Surface Representation (sr) with curly brackets to show where non-concatenative morphological processes have applied. (2) A Lexical Representation (lx) that explicitly shows nonconcatenative processes as insertions, deletions, and substitutions as they apply to the basic form of morphemes. (3) A gloss field (gl) that associates glosses with morphemes and morphological processes in the sr and lx lines. (4) A metalanguage translation. We demonstrate the linguistic adequacy of GGG and compare it to two other IGT annotation schemes.

Cite

CITATION STYLE

APA

Mortensen, D. R., Gulsen, E., He, T., Robinson, N., Amith, J. D., Tjuatja, L., & Levin, L. (2023). Generalized Glossing Guidelines: An Explicit, Human- and Machine-Readable, Item-and-Process Convention for Morphological Annotation. In Proceedings of the Annual Meeting of the Association for Computational Linguistics (pp. 58–67). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2023.sigmorphon-1.7

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free