Improving speech synthesis quality for voices created from an audiobook database

Pavel Chistikov; Dmitriy Zakharov; Andrey Talanov

Conference Proceedings

Improving speech synthesis quality for voices created from an audiobook database

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2014) 8773 276-283

DOI: 10.1007/978-3-319-11581-8_34

3Citations

3Readers

Get full text

Abstract

This paper describes an approach to improving synthesized speech quality for voices created by using an audio book database. The data consist of a large amount of read speech by one speaker, which we matched with the corresponding book texts. The main problems with such a database are the following. First, the recordings were made at different times under different acoustic conditions, and the speaker reads the text with a variety of intonations and accents, which leads to very high voice parameter variability. Second, automatic techniques for sound file labeling make more errors due to the large variability of the database, especially as there can be mismatches between the text and the corresponding sound files. These problems dramatically affect speech synthesis quality, so a robust method for solving them is vital for voices created using audio books. The approach described in the paper is based on statistical models of voice parameters and special algorithms of speech element concatenation and modification. Listening tests show that it strongly improves synthesized speech quality.

Author supplied keywords

Cite

CITATION STYLE

APA

Chistikov, P., Zakharov, D., & Talanov, A. (2014). Improving speech synthesis quality for voices created from an audiobook database. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 8773, pp. 276–283). Springer Verlag. https://doi.org/10.1007/978-3-319-11581-8_34

Improving speech synthesis quality for voices created from an audiobook database

Abstract

Author supplied keywords

Cite

Register to see more suggestions