Our work presents a novel data driven compensation technique that modifies on-line the incoming spectral representation of degraded speech to approximate the features of high quality speech used to train a classifier. We apply the Bayesian inference framework to the degraded spectral coefficients based on modeling clean speech linear-spectrum with appropriate non-Gaussian distributions that allowmaximum a-posteriori (MAP) closed form solution to be set.MAP solution leads to a soft threshold function applied and adapted to the spectral characteristics and noise variance of each spectral band. We perform extensive evaluation of our algorithm against white and coloured Gaussian noise in the context of Automatic Speech Recognition (ASR), and demonstrate its robustness in adverse conditions. The enhancement process comes at little to no extra computational overhead, thus achieving real time, on line performance.
CITATION STYLE
Potamitis, I., Fakotakis, N., & Kokkinakis, G. (2001). Bayesian noise compensation of time trajectories of spectral coefficients for robust speech recognition. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 2166, pp. 214–221). Springer Verlag. https://doi.org/10.1007/3-540-44805-5_28
Mendeley helps you to discover research relevant for your work.