This paper describes the first, three-year phase of a project at the National Research Council of Canada that creates software to assist Indigenous communities in preserving their languages and extending their use. The project aimed to work within the empowerment paradigm, where collaboration with communities and fulfillment of their goals is central. Since many of the technologies we developed were in response to community needs, the project ended up as a collection of diverse subprojects, including the creation of a sophisticated framework for building verb conjugators for highly inflectional polysynthetic languages (such as Kanyen’kéha, in the Iroquoian language family), release of what is probably the largest available corpus of sentences in a polysynthetic language (Inuktut) aligned with English sentences and experiments with machine translation (MT) systems trained on this corpus, free online services based on automatic speech recognition (ASR) for easing the transcription bottleneck for speech recordings, software for implementing text prediction and read-along audiobooks for Indigenous languages, and several other subprojects.
CITATION STYLE
Kuhn, R., Davis, F., Désilets, A., Joanis, E., Kazantseva, A., Knowles, R., … Souter, H. (2020). The Indigenous Languages Technology project at NRC Canada: An empowerment-oriented approach to developing language software. In COLING 2020 - 28th International Conference on Computational Linguistics, Proceedings of the Conference (pp. 5866–5878). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2020.coling-main.516
Mendeley helps you to discover research relevant for your work.