Convolutional neural network for refinement of speaker adaptation transformation

Zbynĕk Zajíc; Jan Zelinka; Jan Vanĕk; Ludĕk Müller

Conference Proceedings

Convolutional neural network for refinement of speaker adaptation transformation

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2014) 8773 161-168

DOI: 10.1007/978-3-319-11581-8_20

1Citations

5Readers

Get full text

Abstract

The aim of this work is to propose a refinement of the shift-MLLR (shift Maximum Likelihood Linear Regression) adaptation of an acoustics model in the case of limited amount of adaptation data, which can lead to ill-conditioned transformations matrices. We try to suppress the influence of badly estimated transformation parameters utilizing the Artificial Neural Network (ANN), especially Convolutional Neural Network (CNN) with bottleneck layer on the end. The badly estimated shift-MLLR transformation is propagated through an ANN (suitably trained beforehand), and the output of the net is used as the new refined transformation. To train the ANN the well and the badly conditioned shift-MLLR transformations are used as outputs and inputs of ANN, respectively.

Author supplied keywords

Cite

CITATION STYLE

APA

Zajíc, Z., Zelinka, J., Vanĕk, J., & Müller, L. (2014). Convolutional neural network for refinement of speaker adaptation transformation. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 8773, pp. 161–168). Springer Verlag. https://doi.org/10.1007/978-3-319-11581-8_20

Convolutional neural network for refinement of speaker adaptation transformation

Abstract

Author supplied keywords

Cite

Register to see more suggestions