Deep learning techniques for speech emotion recognition, from databases to models

Babak Joze Abbaschian; Daniel Sierra-Sosa; Adel Elmaghraby

ArticleOPEN ACCESS

Deep learning techniques for speech emotion recognition, from databases to models

Sensors (Switzerland)

DOI: 10.3390/s21041249

195Citations

371Readers

Abstract

The advancements in neural networks and the on-demand need for accurate and near real-time Speech Emotion Recognition (SER) in human–computer interactions make it mandatory to compare available methods and databases in SER to achieve feasible solutions and a firmer understanding of this open-ended problem. The current study reviews deep learning approaches for SER with available datasets, followed by conventional machine learning techniques for speech emotion recognition. Ultimately, we present a multi-aspect comparison between practical neural network approaches in speech emotion recognition. The goal of this study is to provide a survey of the field of discrete speech emotion recognition.

Author supplied keywords

Cite

CITATION STYLE

APA

Abbaschian, B. J., Sierra-Sosa, D., & Elmaghraby, A. (2021, February 2). Deep learning techniques for speech emotion recognition, from databases to models. Sensors (Switzerland). MDPI AG. https://doi.org/10.3390/s21041249

Deep learning techniques for speech emotion recognition, from databases to models

Abstract

Author supplied keywords

Cite

Register to see more suggestions