Posterior estimates and transforms for speech recognition

Jan Zelinka; Luboš Šmídl; Jan Trmal; Luděk Müller

Conference Proceedings

Posterior estimates and transforms for speech recognition

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2010) 6231 LNAI 480-487

DOI: 10.1007/978-3-642-15760-8_61

0Citations

4Readers

Get full text

Abstract

This paper describes ANN based posterior estimates and their application to speech recognition. We replaced the standard back-propagation with the L-BFGS quasi-Newton method. We have focused only on posterior based feature vector extraction. Our goal was a feature vector dimension reduction. Thus we designed three posterior transforms to space with dimensionality 1 or 2. The designed transforms were tested on the SpeechDat-East corpus. We also applied the introduced method on a Czech audio-visual corpus. In both cases the methods leads to significant word error rate decrease. © 2010 Springer-Verlag Berlin Heidelberg.

Author supplied keywords

Cite

CITATION STYLE

APA

Zelinka, J., Šmídl, L., Trmal, J., & Müller, L. (2010). Posterior estimates and transforms for speech recognition. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 6231 LNAI, pp. 480–487). https://doi.org/10.1007/978-3-642-15760-8_61

Posterior estimates and transforms for speech recognition

Abstract

Author supplied keywords

Cite

Register to see more suggestions