We present single-channel approaches to robust automatic speech recognition (ASR) in reverberant environments based on non-intrusive estimation of the clarity index (C 50). Our best performing method includes the estimated value of C 50 in the ASR feature vector and also uses C 50 to select the most suitable ASR acoustic model according to the reverberation level. We evaluate our method on the REVERB Challenge database employing two different C 50 estimators and show that our method outperforms the best baseline of the challenge achieved without unsupervised acoustic model adaptation, i.e. using multi-condition hidden Markov models (HMMs). Our approach achieves a 22.4 % relative word error rate reduction in comparison to the best baseline of the challenge.
CITATION STYLE
Parada, P. P., Sharma, D., Naylor, P. A., & Waterschoot, T. van. (2015). Reverberant speech recognition exploiting clarity index estimation. Eurasip Journal on Advances in Signal Processing, 2015(1). https://doi.org/10.1186/s13634-015-0237-7
Mendeley helps you to discover research relevant for your work.