Disfluent speech analysis and synthesis: A preliminary approach

Jordi Adell; Antonio Bonafonte; David Escudero

Conference ProceedingsOPEN ACCESS

Disfluent speech analysis and synthesis: A preliminary approach

Proceedings of the International Conference on Speech Prosody (2006)

DOI: 10.21437/speechprosody.2006-120

14Citations

18Readers

Abstract

Despite of the existence of high quality unit selection speech synthesizers, they are based on a reading style approach. However, new applications such as Speech-to-Speech Translation or Speech User Interfaces demand a talking style which is more natural in these contexts. Disfluencies are a major characteristic of talking style so that it is convenient to be able to generate disfluent speech. In the present paper a preliminary analysis of pitch and segmental duration in repetitions and filled pauses is presented. Simple rules to predict these prosodic features are derived from the previous analysis and used for synthesis. Evaluation shows an increase in naturalness while overall quality is decreased.

Cite

CITATION STYLE

APA

Adell, J., Bonafonte, A., & Escudero, D. (2006). Disfluent speech analysis and synthesis: A preliminary approach. In Proceedings of the International Conference on Speech Prosody. International Speech Communication Association. https://doi.org/10.21437/speechprosody.2006-120

Disfluent speech analysis and synthesis: A preliminary approach

Abstract

Cite

Register to see more suggestions