Design of a yoruba language speech corpus for the purposes of text-to-speech (TTS) synthesis

Théophile K. Dagba; John O.R. Aoga; Codjo C. Fanou

Conference Proceedings

Design of a yoruba language speech corpus for the purposes of text-to-speech (TTS) synthesis

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2016) 9621 161-169

DOI: 10.1007/978-3-662-49381-6_16

4Citations

4Readers

Get full text

Abstract

This paper deals with the design of a speech corpus for a corpus-based Text-To-Speech (TTS) synthesis approach. The purposes are first to provide enough speech to develop Yoruba corpus-based TTS system and second, to provide a simple methodology for other languages corpus design. The paper focuses on text analysis, selection of the reliable sentences, selection of the reader, and sentences recording. The analysis is performed to ensure a good balance of the corpus. Then, 2,415 sentences are gathered (essentially affirmative sentences). Those sentences have been read by a Yoruba language journalist who is a native speaker of the language. There is one speaker for the whole corpus.

Author supplied keywords

Cite

CITATION STYLE

APA

Dagba, T. K., Aoga, J. O. R., & Fanou, C. C. (2016). Design of a yoruba language speech corpus for the purposes of text-to-speech (TTS) synthesis. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 9621, pp. 161–169). Springer Verlag. https://doi.org/10.1007/978-3-662-49381-6_16

Design of a yoruba language speech corpus for the purposes of text-to-speech (TTS) synthesis

Abstract

Author supplied keywords

Cite

Register to see more suggestions