Morphosyntactic analyzer for the Tibetan language: Aspects of structural ambiguity

10Citations
Citations of this article
1Readers
Mendeley users who have this article in their library.
Get full text

Abstract

The paper deals with the development of a morphosyntactic analyzer for the Tibetan language. It aims to create a consistent formal grammatical description (formal grammar) of the Tibetan language, including all grammar levels of the language system from morphosyntax (syntactics of morphemes) to the syntax of composite sentences and supra-phrasal entities. Syntactic annotation was created on the basis of morphologically tagged corpora of Tibetan texts. The peculiarity of the annotation consists in combining both the immediate constituents structure and the dependency one. An individual (basic) grammar module of Tibetan grammatical categories, its possible values, and restrictions on their combination are created. Types of tokens and their grammatical features form the basis of the formal grammar being produced, allowing linguistic processor to build syntactic trees of various kinds. Methods of avoiding redundant structural ambiguity are proposed.

Cite

CITATION STYLE

APA

Dobrov, A., Dobrova, A., Grokhovskiy, P., Soms, N., & Zakharov, V. (2016). Morphosyntactic analyzer for the Tibetan language: Aspects of structural ambiguity. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 9924 LNCS, pp. 215–222). Springer Verlag. https://doi.org/10.1007/978-3-319-45510-5_25

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free