Improving speech recognition through automatic selection of age group – specific acoustic models

6Citations
Citations of this article
35Readers
Mendeley users who have this article in their library.
Get full text

Abstract

The acoustic models used by automatic speech recognisers are usually trained with speech collected from young to middle-aged adults. As the characteristics of speech change with age, such acoustic models tend to perform poorly on children’s and elderly people’s speech. In this study, we investigate whether the automatic age group classification of speakers, together with age group –specific acoustic models, could improve automatic speech recognition performance. We train an age group classifier with an accuracy of about 95% and show that using the results of the classifier to select age group –specific acoustic models for children and the elderly leads to considerable gains in automatic speech recognition performance, as compared with using acoustic models trained with young to middle-aged adults’ speech for recognising their speech, as well.

Cite

CITATION STYLE

APA

Hämäläinen, A., Meinedo, H., Tjalve, M., Pellegrini, T., Trancoso, I., & Dias, M. S. (2014). Improving speech recognition through automatic selection of age group – specific acoustic models. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 8775, pp. 12–23). Springer Verlag. https://doi.org/10.1007/978-3-319-09761-9

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free