A clustering algorithm for the fast match of acoustic conditions in continuous speech recognition

0Citations
Citations of this article
5Readers
Mendeley users who have this article in their library.
Get full text

Abstract

In practical speech recognition applications, channel/environment conditions may not match those of the corpus used to estimate the acoustic models. A straightforward methodology is proposed in this paper by which the speech recognizer can match the acoustic conditions of input utterances, thus allowing instantaneous adaptation schemes. First a number of clusters is determined in the training material in a fully unsupervised way, using a dissimilarity measure based on shallow acoustic models. Then accurate acoustic models are estimated for each cluster, and finally a fast match strategy, based on the shallow models, is used to choose the most likely acoustic condition for each input utterance. The performance of the clustering algorithm was tested on two speech databases in Spanish: SENGLAR (read speech) and CORLEC-EHU-1 (spontaneous human-human dialogues). In both cases, speech utterances were consistently grouped by gender, by recording conditions or by background/ channel noise. Furthermore, the fast match methodology led to noticeable improvements in preliminary phonetic recognition experiments, at 20-50% of the computational cost of the ML match. © Springer-Verlag Berlin Heidelberg 2005.

Cite

CITATION STYLE

APA

Rodríguez, L. J., & Torres, M. I. (2005). A clustering algorithm for the fast match of acoustic conditions in continuous speech recognition. In Lecture Notes in Computer Science (Vol. 3523, pp. 562–570). Springer Verlag. https://doi.org/10.1007/11492542_69

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free