Third-order moments of filtered speech signals for robust speech recognition

2Citations
Citations of this article
13Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Novel speech features calculated from third-order statistics of subband-filtered speech signals are introduced and studied for robust speech recognition. These features have the potential to capture nonlinear information not represented by cepstral coefficients. Also, because the features presented in this paper are based on the third-order moments, they may be more immune to Gaussian noise than cepstrals, as Gaussian distributions have zero third-order moments. Experiments on the AURORA2 database studying these features in combination with Mel-frequency cepstral coefficients (MFCC's) are presented, and some improvement over the MFCC-only baseline is shown when clean speech is used for training, though the same improvement is not seen when multi-condition training data is used. © Springer-Verlag Berlin Heidelberg 2005.

Cite

CITATION STYLE

APA

Indrebo, K. M., Povinelli, R. J., & Johnson, M. T. (2005). Third-order moments of filtered speech signals for robust speech recognition. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 3817 LNAI, pp. 277–283). https://doi.org/10.1007/11613107_24

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free