Towards a General-Purpose Linguistic Annotation Backend

  • Neubig G
  • Littell P
  • Chen C
  • et al.
N/ACitations
Citations of this article
8Readers
Mendeley users who have this article in their library.

Abstract

Language documentation is inherently a time-intensive process; transcription, glossing, and corpus management consume a significant portion of documentary linguists’ work. Advances in natural language processing can help to accelerate this work, using the linguists’ past decisions as training material, but questions remain about how to prioritize human involvement. In this extended abstract, we describe the beginnings of a new project that will attempt to ease this language documentation process through the use of natural language processing (NLP) technology. It is based on (1) methods to adapt NLP tools to new languages, based on recent advances in massively multilingual neural networks, and (2) backend APIs and interfaces that allow linguists to upload their data (§2). We then describe our current progress on two fronts: automatic phoneme transcription, and glossing (§3). Finally, we briefly describe our future directions (§4).

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Neubig, G., Littell, P., Chen, C.-Y., Lee, J., Li, Z., Lin, Y.-H., & Zhang, Y. (2019). Towards a General-Purpose Linguistic Annotation Backend. Proceedings of the Workshop on Computational Methods for Endangered Languages, 2(1). https://doi.org/10.33011/computel.v2i.437

Readers' Seniority

Tooltip

Researcher 4

57%

PhD / Post grad / Masters / Doc 2

29%

Professor / Associate Prof. 1

14%

Readers' Discipline

Tooltip

Computer Science 5

71%

Linguistics 2

29%

Save time finding and organizing research with Mendeley

Sign up for free