The robustness of speech representations obtained from simulated auditory nerve fibers under different noise conditions

Tim Jürgens; Thomas Brand; Nicholas R. Clark; Ray Meddis; Guy J. Brown

Journal ArticleOPEN ACCESS

The robustness of speech representations obtained from simulated auditory nerve fibers under different noise conditions

Jürgens T
Brand T
Clark N
et al.

The Journal of the Acoustical Society of America (2013) 134(3) EL282-EL288

DOI: 10.1121/1.4817912

4Citations

28Readers

Abstract

Different methods of extracting speech features from an auditory model were systematically investigated in terms of their robustness to different noises. The methods either computed the average firing rate within frequency channels (spectral features) or inter-spike-intervals (timing features) from the simulated auditory nerve response. When used as the front-end for an automatic speech recognizer, timing features outperformed spectral features in Gaussian noise. However, this advantage was lost in babble, because timing features extracted the spectro-temporal structure of babble noise, which is similar to the target speaker. This suggests that different feature extraction methods are optimal depending on the background noise.

Cite

CITATION STYLE

APA

Jürgens, T., Brand, T., Clark, N. R., Meddis, R., & Brown, G. J. (2013). The robustness of speech representations obtained from simulated auditory nerve fibers under different noise conditions. The Journal of the Acoustical Society of America, 134(3), EL282–EL288. https://doi.org/10.1121/1.4817912

The robustness of speech representations obtained from simulated auditory nerve fibers under different noise conditions

Abstract

Cite

Register to see more suggestions