Morphosyntactic analyzer for the Tibetan language: Aspects of structural ambiguity

Alexei Dobrov; Anastasia Dobrova; Pavel Grokhovskiy; Nikolay Soms; Victor Zakharov

Book Chapter

Morphosyntactic analyzer for the Tibetan language: Aspects of structural ambiguity

Springer Verlag, (2016), 215-222

DOI: 10.1007/978-3-319-45510-5_25

10Citations

1Readers

Get full text

Abstract

The paper deals with the development of a morphosyntactic analyzer for the Tibetan language. It aims to create a consistent formal grammatical description (formal grammar) of the Tibetan language, including all grammar levels of the language system from morphosyntax (syntactics of morphemes) to the syntax of composite sentences and supra-phrasal entities. Syntactic annotation was created on the basis of morphologically tagged corpora of Tibetan texts. The peculiarity of the annotation consists in combining both the immediate constituents structure and the dependency one. An individual (basic) grammar module of Tibetan grammatical categories, its possible values, and restrictions on their combination are created. Types of tokens and their grammatical features form the basis of the formal grammar being produced, allowing linguistic processor to build syntactic trees of various kinds. Methods of avoiding redundant structural ambiguity are proposed.

Cite

CITATION STYLE

APA

Dobrov, A., Dobrova, A., Grokhovskiy, P., Soms, N., & Zakharov, V. (2016). Morphosyntactic analyzer for the Tibetan language: Aspects of structural ambiguity. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 9924 LNCS, pp. 215–222). Springer Verlag. https://doi.org/10.1007/978-3-319-45510-5_25

Morphosyntactic analyzer for the Tibetan language: Aspects of structural ambiguity

Abstract

Cite

Register to see more suggestions