Nonlinear speech features for the objective detection of discontinuities in concatenative speech synthesis

Yannis Pantazis; Yannis Stylianou

Conference Proceedings

Nonlinear speech features for the objective detection of discontinuities in concatenative speech synthesis

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2005) 3445 LNAI 375-383

DOI: 10.1007/11520153_21

0Citations

4Readers

Get full text

Abstract

An objective distance measure which is able to predict audible discontinuities in concatenative speech synthesis systems is very important. Previous results showed that linear approaches are not very effective to detect audible discontinuities. The best result was obtained by using the Kullback-Leibler distance on power spectra with the rate of 37%. In this paper, we present two nonlinear approaches for the detection of discontinuities. The first method is based on a nonlinear harmonic model for speech while the second method is based on the demodulation of speech in an amplitude and a frequency component using the Teager energy operator. Results show that detection rate can exceed 70%, which is an improvement of about 95% over previous published results. © Springer-Verlag Berlin Heidelberg 2005.

Cite

CITATION STYLE

APA

Pantazis, Y., & Stylianou, Y. (2005). Nonlinear speech features for the objective detection of discontinuities in concatenative speech synthesis. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 3445 LNAI, pp. 375–383). Springer Verlag. https://doi.org/10.1007/11520153_21

Nonlinear speech features for the objective detection of discontinuities in concatenative speech synthesis

Abstract

Cite

Register to see more suggestions