This paper describes ANN based posterior estimates and their application to speech recognition. We replaced the standard back-propagation with the L-BFGS quasi-Newton method. We have focused only on posterior based feature vector extraction. Our goal was a feature vector dimension reduction. Thus we designed three posterior transforms to space with dimensionality 1 or 2. The designed transforms were tested on the SpeechDat-East corpus. We also applied the introduced method on a Czech audio-visual corpus. In both cases the methods leads to significant word error rate decrease. © 2010 Springer-Verlag Berlin Heidelberg.
CITATION STYLE
Zelinka, J., Šmídl, L., Trmal, J., & Müller, L. (2010). Posterior estimates and transforms for speech recognition. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 6231 LNAI, pp. 480–487). https://doi.org/10.1007/978-3-642-15760-8_61
Mendeley helps you to discover research relevant for your work.