Feature denoising using joint sparse representation for in-car speech recognition

Weifeng Li; Yicong Zhou; Norman Poh; Fei Zhou; Qingmin Liao

Journal Article

Feature denoising using joint sparse representation for in-car speech recognition

IEEE Signal Processing Letters (2013) 20(7) 681-684

DOI: 10.1109/LSP.2013.2245894

38Citations

21Readers

Get full text

Abstract

We address reducing the mismatch between training and testing conditions for hands-free in-car speech recognition. It is well known that the distortions caused by background noise, channel effects, etc., are highly nonlinear in the log-spectral or cepstral domain. This letter introduces a joint sparse representation (JSR) to estimate the underlying clean feature vector from a noisy feature vector. Performing a joint dictionary learning by sharing the same representation coefficients, the proposed method intends to capture the complex relationships (or mapping functions) between clean and noisy speech. Speech recognition experiments on realistic in-car data demonstrate that the proposed method shows excellent recognition performance with a relative improvement of 39.4% compared with the 'baseline' frontends. © 2012 IEEE.

Author supplied keywords

Cite

CITATION STYLE

APA

Li, W., Zhou, Y., Poh, N., Zhou, F., & Liao, Q. (2013). Feature denoising using joint sparse representation for in-car speech recognition. IEEE Signal Processing Letters, 20(7), 681–684. https://doi.org/10.1109/LSP.2013.2245894

Feature denoising using joint sparse representation for in-car speech recognition

Abstract

Author supplied keywords

Cite

Register to see more suggestions