Speech Emotion Recognition Based on Multiple Acoustic Features and Deep Convolutional Neural Network

Kishor Bhangale; Mohanaprasad Kothandaraman

Journal ArticleOPEN ACCESS

Speech Emotion Recognition Based on Multiple Acoustic Features and Deep Convolutional Neural Network

Electronics (Switzerland) (2023) 12(4)

DOI: 10.3390/electronics12040839

40Citations

54Readers

Abstract

Speech emotion recognition (SER) plays a vital role in human–machine interaction. A large number of SER schemes have been anticipated over the last decade. However, the performance of the SER systems is challenging due to the high complexity of the systems, poor feature distinctiveness, and noise. This paper presents the acoustic feature set based on Mel frequency cepstral coefficients (MFCC), linear prediction cepstral coefficients (LPCC), wavelet packet transform (WPT), zero crossing rate (ZCR), spectrum centroid, spectral roll-off, spectral kurtosis, root mean square (RMS), pitch, jitter, and shimmer to improve the feature distinctiveness. Further, a lightweight compact one-dimensional deep convolutional neural network (1-D DCNN) is used to minimize the computational complexity and to represent the long-term dependencies of the speech emotion signal. The overall effectiveness of the proposed SER systems’ performance is evaluated on the Berlin Database of Emotional Speech (EMODB) and the Ryerson Audio-Visual Database of Emotional Speech and Song (RAVDESS) datasets. The proposed system gives an overall accuracy of 93.31% and 94.18% for the EMODB and RAVDESS datasets, respectively. The proposed MFCC and 1-D DCNN provide greater accuracy and outpace the traditional SER techniques.

Author supplied keywords

Cite

CITATION STYLE

APA

Bhangale, K., & Kothandaraman, M. (2023). Speech Emotion Recognition Based on Multiple Acoustic Features and Deep Convolutional Neural Network. Electronics (Switzerland), 12(4). https://doi.org/10.3390/electronics12040839

Speech Emotion Recognition Based on Multiple Acoustic Features and Deep Convolutional Neural Network

Abstract

Author supplied keywords

Cite

Register to see more suggestions