Improved emotion recognition with novel task-oriented wavelet packet features

6Citations
Citations of this article
8Readers
Mendeley users who have this article in their library.
Get full text

Abstract

In this paper, a wavelet packet based adaptive filter-bank construction method is proposed for speech signal processing. On this basis, a set of acoustic features are proposed for speech emotion recognition, namely Wavelet Packet Cepstral Coefficients (WPCC). The former extends the conventional Mel-Frequency Cepstral Coefficients (MFCC) by adapting the filter-bank structure according to the decision task; while the later aims at selecting the most crucial frequency bands where the most discriminative emotion information is located. Speech emotion recognition system is constructed with the two proposed feature sets and Gaussian mixture model as classifier. Experimental results on Berlin emotional speech database show that the proposed features improve emotion recognition performance over the conventional MFCC feature. The proposed feature extraction scheme has encouraging prospects since it can be extended to 2D image processing with 2D wavelet packets and hence extended to audio-visual bimodal emotion recognition application. © 2014 Springer International Publishing Switzerland.

Cite

CITATION STYLE

APA

Huang, Y., Zhang, G., Li, Y., & Wu, A. (2014). Improved emotion recognition with novel task-oriented wavelet packet features. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 8588 LNCS, pp. 706–714). Springer Verlag. https://doi.org/10.1007/978-3-319-09333-8_77

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free