Comparing qmt1 and HMMS for the synthesis of American English prosody

Sacha Krstulović; Javier Latorre; Sabine Buchholz

Conference Proceedings

Comparing qmt1 and HMMS for the synthesis of American English prosody

Proceedings of the 4th International Conference on Speech Prosody, SP 2008 (2008) 67-70

DOI: 10.21437/speechprosody.2008-15

7Citations

12Readers

Get full text

Abstract

Three models are compared for the duration and pitch contour of American English in a speech synthesis framework. These models combine duration prediction by Quantification Method Type 1 (QMT1), a Codebook-based method for the F0 contour and a Hidden Markov Model-based method for both durations and F0. Subjective listening tests show that the HMMs are preferred over the Codebook for the F0 contour, but that their duration modelling performances are not significantly different from those of QMT1 in the tested setup. An analysis of naive freeform listener comments supports this fact, and suggests that such comments can give useful hints regarding the performance of each system.

Cite

CITATION STYLE

APA

Krstulović, S., Latorre, J., & Buchholz, S. (2008). Comparing qmt1 and HMMS for the synthesis of American English prosody. In Proceedings of the 4th International Conference on Speech Prosody, SP 2008 (pp. 67–70). International Speech Communication Association. https://doi.org/10.21437/speechprosody.2008-15

Comparing qmt1 and HMMS for the synthesis of American English prosody

Abstract

Cite

Register to see more suggestions