Deep neural network adaptation for children's and adults' speech recognition

  • Serizel, Romain
  • Giuliani, Diego
N/ACitations
Citations of this article
16Readers
Mendeley users who have this article in their library.

Abstract

English. This paper introduces a novel application of the hybrid deep neural network (DNN)-hidden Markov model (HMM) approach for automatic speech recognition (ASR) to target groups of speakers of a specific age/gender. The group-specific training of DNN is investigated and shown to be inefficient when the amount of training data is limited. To overcome this problem, the recent approach that consists in adapting a general DNN to domain/language specific data is extended to target age/gender groups in the context of hybrid DNN-HMM systems, reducing consistently the phone error rate by 15-20% relative for the three different speaker groups.

Cite

CITATION STYLE

APA

Serizel, Romain, & Giuliani, Diego. (2022). Deep neural network adaptation for children’s and adults’ speech recognition. In Proceedings of the First Italian Conference on Computational Linguistics CLiC-it 2014 and of the Fourth International Workshop EVALITA 2014 9-11 December 2014, Pisa. pisa university press. https://doi.org/10.12871/clicit2014166

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free