Deep learning techniques for speech emotion recognition, from databases to models

195Citations
Citations of this article
371Readers
Mendeley users who have this article in their library.

Abstract

The advancements in neural networks and the on-demand need for accurate and near real-time Speech Emotion Recognition (SER) in human–computer interactions make it mandatory to compare available methods and databases in SER to achieve feasible solutions and a firmer understanding of this open-ended problem. The current study reviews deep learning approaches for SER with available datasets, followed by conventional machine learning techniques for speech emotion recognition. Ultimately, we present a multi-aspect comparison between practical neural network approaches in speech emotion recognition. The goal of this study is to provide a survey of the field of discrete speech emotion recognition.

Cite

CITATION STYLE

APA

Abbaschian, B. J., Sierra-Sosa, D., & Elmaghraby, A. (2021, February 2). Deep learning techniques for speech emotion recognition, from databases to models. Sensors (Switzerland). MDPI AG. https://doi.org/10.3390/s21041249

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free