Arabic Speech Emotion Recognition from Saudi Dialect Corpus

31Citations
Citations of this article
73Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

Recognizing emotions from speech using machine learning algorithms has become an active research topic lately as a result of the demand for more human interactive applications. Emotion recognition systems are mostly implemented in German, English, Spanish, Dutch, Danish, and other European and Asian languages due to the availability of datasets for these languages. However, for Arabic, there is an extremely limited number of available speech emotion datasets. Therefore, in this paper studies emotion recognition based on Arabic Saudi dialect spoken data. The dataset was created from freely available YouTube videos and labeled using four perceived emotions: anger, happiness, sadness, and neutral. Various spectral features such as the mel-frequency cepstral coefficient (MFCC) and mel spectrogram, were extracted, and then the classification methods support vector machine (SVM), multi-layer perceptron (MLP), and k-nearest neighbor (KNN) were applied. The results were discussed, analyzed, and compared between the three models using different feature extractions. Experiments showed that SVM obtained the best accuracy result with 77.14%, demonstrating improvement in Arabic speech emotion recognition for this classification method.

Cite

CITATION STYLE

APA

Aljuhani, R. H., Alshutayri, A., & Alahdal, S. (2021). Arabic Speech Emotion Recognition from Saudi Dialect Corpus. IEEE Access, 9, 127081–127085. https://doi.org/10.1109/ACCESS.2021.3110992

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free