Emotion Recognition from Speech Using Wavelet Packet Transform Cochlear Filter Bank and Random Forest Classifier

Shibani Hamsa; Ismail Shahin; Youssef Iraqi; Naoufel Werghi

Journal ArticleOPEN ACCESS

Emotion Recognition from Speech Using Wavelet Packet Transform Cochlear Filter Bank and Random Forest Classifier

IEEE Access (2020) 8 96994-97006

DOI: 10.1109/ACCESS.2020.2991811

49Citations

64Readers

Abstract

This research aims to design and implement an artificial emotional intelligence system that is capable of identifying the unknown emotion of the speaker. To that end, we propose a novel framework for emotion recognition in the presence of noise and interference. Our approach accounts for energy, time and spectral parameters to examine the emotion of the speaker. However, rather than using Gammatone filterbank and short-time Fourier transform (STFT), commonly adopted in the literature, we propose employing a novel wavelet packet transform (WPT) based cochlear filterbank. Our system, coupling this representation with random forest classifier, shows superior performance over other existing algorithms when appraised on three distinct speech corpora in two different languages, and considering also stressful and noisy talking conditions.

Author supplied keywords

Cite

CITATION STYLE

APA

Hamsa, S., Shahin, I., Iraqi, Y., & Werghi, N. (2020). Emotion Recognition from Speech Using Wavelet Packet Transform Cochlear Filter Bank and Random Forest Classifier. IEEE Access, 8, 96994–97006. https://doi.org/10.1109/ACCESS.2020.2991811

Emotion Recognition from Speech Using Wavelet Packet Transform Cochlear Filter Bank and Random Forest Classifier

Abstract

Author supplied keywords

Cite

Register to see more suggestions