We propose a novel feature processing technique which can provide a cepstral liftering effect in the log-spectral domain. Cepstral liftering aims at the equalization of variance of cepstral coefficients for the distance-based speech recognizer, and as a result, provides the robustness for additive noise and speaker variability. However, in the popular hidden Markov model based framework, cepstral liftering has no effect in recognition performance. We derive a filtering method in log-spectral domain corresponding to the cepstral liftering. The proposed method performs a high-pass filtering based on the decorrelation of filter-bank energies. We show that in noisy speech recognition, the proposed method reduces the error rate by 52.7% to conventional feature.
CITATION STYLE
Jung, H. Y. (2004). Filtering of filter-bank energies for robust speech recognition. ETRI Journal, 26(3), 273–276. https://doi.org/10.4218/etrij.04.0203.0033
Mendeley helps you to discover research relevant for your work.