SIGNAL PROCESSING FOR ROBUST SPEECH RECOGNITION

Richard M. Stern; Fu Hua Liu; Pedro J. Moreno; Alejandro Acero

Conference ProceedingsOPEN ACCESS

SIGNAL PROCESSING FOR ROBUST SPEECH RECOGNITION

3rd International Conference on Spoken Language Processing, ICSLP 1994 (1994) 1027-1030

DOI: 10.3115/1075812.1075889

5Citations

77Readers

Abstract

This paper describes several new cepstral-based compensation procedures that render the SPHINX-II system more robust with respect to acoustical environment. The first algorithm, phone-dependent cepstral compensation, is similar in concept to the previously-described MFCDCN method, except that cepstral compensation vectors are selected according to the current phonetic hypothesis, rather than on the basis of SNR or VQ codeword identity. We also describe two procedures to accomplish adaptation of the VQ codebook for new environments. Use of the various compensation algorithms in consort produces a reduction of error rates for SPHINX-II by as much as 40 percent relative to the rate achieved with cepstral mean normalization alone.

Cite

CITATION STYLE

APA

Stern, R. M., Liu, F. H., Moreno, P. J., & Acero, A. (1994). SIGNAL PROCESSING FOR ROBUST SPEECH RECOGNITION. In 3rd International Conference on Spoken Language Processing, ICSLP 1994 (pp. 1027–1030). The International Society for Computers and Their Applications (ISCA). https://doi.org/10.3115/1075812.1075889

SIGNAL PROCESSING FOR ROBUST SPEECH RECOGNITION

Abstract

Cite

Register to see more suggestions