Requirements and Motivations of Low-Resource Speech Synthesis for Language Revitalization

28Citations
Citations of this article
54Readers
Mendeley users who have this article in their library.

Abstract

This paper describes the motivation and development of speech synthesis systems for the purposes of language revitalization. By building speech synthesis systems for three Indigenous languages spoken in Canada, Kanien'kéha, Gitksan & SENCOTEN, we re-evaluate the question of how much data is required to build low-resource speech synthesis systems featuring state-of-the-art neural models. For example, preliminary results with English data show that a FastSpeech2 model trained with 1 hour of training data can produce speech with comparable naturalness to a Tacotron2 model trained with 10 hours of data. Finally, we motivate future research in evaluation and classroom integration in the field of speech synthesis for language revitalization.

Cite

CITATION STYLE

APA

Pine, A., Wells, D., Brinklow, N. T., Littell, P., & Richmond, K. (2022). Requirements and Motivations of Low-Resource Speech Synthesis for Language Revitalization. In Proceedings of the Annual Meeting of the Association for Computational Linguistics (Vol. 1, pp. 7346–7359). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2022.acl-long.507

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free