Enriching, editing, and representing interlinear glossed text

Fei Xia; Michael Wayne Goodman; Ryan Georgi; Glenn Slayden; William D. Lewis

Conference Proceedings

Enriching, editing, and representing interlinear glossed text

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2015) 9041 32-46

DOI: 10.1007/978-3-319-18111-0_3

0Citations

1Readers

Get full text

Abstract

The majority of the world’s languages have little to no NLP resources or tools. This is due to a lack of training data (“resources”) over which tools, such as taggers or parsers, can be trained. In recent years, there have been increasing efforts to apply NLP methods to a much broader swathe of the worlds languages. In many cases this involves bootstrapping the learning process with enriched or partially enriched resources. One promising line of research involves the use of Interlinear Glossed Text (IGT), a very common form of annotated data used in the field of linguistics. Although IGT is generally very richly annotated, and can be enriched even further (e.g., through structural projection), much of the content is not easily consumable by machines since it remains “trapped” in linguistic scholarly documents and in human readable form. In this paper, we introduce several tools that make IGT more accessible and consumable by NLP researchers.

Cite

CITATION STYLE

APA

Xia, F., Goodman, M. W., Georgi, R., Slayden, G., & Lewis, W. D. (2015). Enriching, editing, and representing interlinear glossed text. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 9041, pp. 32–46). Springer Verlag. https://doi.org/10.1007/978-3-319-18111-0_3

Enriching, editing, and representing interlinear glossed text

Abstract

Cite

Register to see more suggestions