In this paper we present a speech analysis/synthesis coder based on a combination of linear prediction with nonlinear modeling of the residual using a regularized radial basis function (RBF) network. The model has been applied to synthesis of sustained vowel signals and has been found to preserve the dynamics and spectra of the original speech signal. While several nonlinear speech models reportedly suffer from high-frequency losses in the synthesized speech due to system inherent low-pass behavior, our approach achieves good speech signal reproduction even in the higher frequency ranges. The decomposition of the speech signal by linear prediction analysis supports processing during synthesis such as pitch modifications while the nonlinear modeling provides the means for adequate reproduction of the fine-grained dynamic characteristics of speech. © Springer-Verlag Berlin Heidelberg 2001.
CITATION STYLE
Rank, E., & Kubin, G. (2001). Nonlinear synthesis of vowels in the LP residual domain with a regularized RBF network. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 2085 LNCS, pp. 746–753). Springer Verlag. https://doi.org/10.1007/3-540-45723-2_90
Mendeley helps you to discover research relevant for your work.