Multi-candidate missing data imputation for robust speech recognition

Yujun Wang; Hugo Van Hamme

Journal ArticleOPEN ACCESS

Multi-candidate missing data imputation for robust speech recognition

Eurasip Journal on Audio, Speech, and Music Processing (2012) 2012(1)

DOI: 10.1186/1687-4722-2012-17

2Citations

6Readers

Abstract

The application of Missing Data Techniques (MDT) to increase the noise robustness of HMM/GMM-based large vocabulary speech recognizers is hampered by a large computational burden. The likelihood evaluations imply solving many constrained least squares (CLSQ) optimization problems. As an alternative, researchers have proposed frontend MDT or have made oversimplifying independence assumptions for the backend acoustic model. In this article, we propose a fast Multi-Candidate (MC) approach that solves the per-Gaussian CLSQ problems approximately by selecting the best from a small set of candidate solutions, which are generated as the MDT solutions on a reduced set of cluster Gaussians. Experiments show that the MC MDT runs equally fast as the uncompensated recognizer while achieving the accuracy of the full backend optimization approach. The experiments also show that exploiting the more accurate acoustic model of the backend does pay off in terms of accuracy when compared to frontend MDT. © 2012 Wang and Van hamme; licensee Springer.

Author supplied keywords

Cite

CITATION STYLE

APA

Wang, Y., & Van Hamme, H. (2012). Multi-candidate missing data imputation for robust speech recognition. Eurasip Journal on Audio, Speech, and Music Processing, 2012(1). https://doi.org/10.1186/1687-4722-2012-17

Multi-candidate missing data imputation for robust speech recognition

Abstract

Author supplied keywords

Cite

Register to see more suggestions