This paper presents a formant frequency tracking algorithm for continuous speech processing. First, it uses spectral information for generating frequency candidates. For this purpose, the roots of the polynomial of a Linear Predictive Coding (LPC) and peak picking of Chirp Group Delay Function (CGD) were tested. The second stage is a beam-search algorithm that tries to find the best sequence of formants given the frequency candidates, applying a cost function based on local and global evidences. The main advantage of this beam-search algorithm compared with previous dynamic programming approaches lies in that a trajectory function that takes into account several frames can be optimally incorporated to the cost function. The performance was evaluated using a labeled formant database and the Wavesurfer formant tracker, achieving promising results. © 2012 Springer-Verlag.
CITATION STYLE
Laínez, J. E. G., González, D. R., Artiaga, A. M., Solano, E. L., & De Lara, J. R. C. (2012). Beam-search formant tracking algorithm based on trajectory functions for continuous speech. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 7441 LNCS, pp. 749–756). https://doi.org/10.1007/978-3-642-33275-3_92
Mendeley helps you to discover research relevant for your work.