Reverberant speech recognition exploiting clarity index estimation

Pablo Peso Parada; Dushyant Sharma; Patrick A. Naylor; Toon van Waterschoot

Journal ArticleOPEN ACCESS

Reverberant speech recognition exploiting clarity index estimation

Eurasip Journal on Advances in Signal Processing (2015) 2015(1)

DOI: 10.1186/s13634-015-0237-7

11Citations

20Readers

Abstract

We present single-channel approaches to robust automatic speech recognition (ASR) in reverberant environments based on non-intrusive estimation of the clarity index (C 50). Our best performing method includes the estimated value of C 50 in the ASR feature vector and also uses C 50 to select the most suitable ASR acoustic model according to the reverberation level. We evaluate our method on the REVERB Challenge database employing two different C 50 estimators and show that our method outperforms the best baseline of the challenge achieved without unsupervised acoustic model adaptation, i.e. using multi-condition hidden Markov models (HMMs). Our approach achieves a 22.4 % relative word error rate reduction in comparison to the best baseline of the challenge.

Author supplied keywords

Cite

CITATION STYLE

APA

Parada, P. P., Sharma, D., Naylor, P. A., & Waterschoot, T. van. (2015). Reverberant speech recognition exploiting clarity index estimation. Eurasip Journal on Advances in Signal Processing, 2015(1). https://doi.org/10.1186/s13634-015-0237-7

Reverberant speech recognition exploiting clarity index estimation

Abstract

Author supplied keywords

Cite

Register to see more suggestions