This paper deals with the design of a speech corpus for a corpus-based Text-To-Speech (TTS) synthesis approach. The purposes are first to provide enough speech to develop Yoruba corpus-based TTS system and second, to provide a simple methodology for other languages corpus design. The paper focuses on text analysis, selection of the reliable sentences, selection of the reader, and sentences recording. The analysis is performed to ensure a good balance of the corpus. Then, 2,415 sentences are gathered (essentially affirmative sentences). Those sentences have been read by a Yoruba language journalist who is a native speaker of the language. There is one speaker for the whole corpus.
CITATION STYLE
Dagba, T. K., Aoga, J. O. R., & Fanou, C. C. (2016). Design of a yoruba language speech corpus for the purposes of text-to-speech (TTS) synthesis. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 9621, pp. 161–169). Springer Verlag. https://doi.org/10.1007/978-3-662-49381-6_16
Mendeley helps you to discover research relevant for your work.