Benchmarking classification models for emotion recognition in natural speech: A multi-corporal study

Alexey Tarasov; Sarah Jane Delany

Conference Proceedings

Benchmarking classification models for emotion recognition in natural speech: A multi-corporal study

2011 IEEE International Conference on Automatic Face and Gesture Recognition and Workshops, FG 2011 (2011) 841-845

DOI: 10.1109/FG.2011.5771359

10Citations

29Readers

Get full text

Abstract

A significant amount of the research on automatic emotion recognition from speech focuses on acted speech that is produced by professional actors. This approach often leads to overoptimistic results as the recognition of emotion in real-life conditions is more challenging due the propensity of mixed and less intense emotions in natural speech. The paper presents an empirical study of the most widely used classifiers in the domain of emotion recognition from speech, across multiple non-acted emotional speech corpora. The results indicate that Support Vector Machines have the best performance and that they along with Multi-Layer Perceptron networks and k-nearest neighbour classifiers perform significantly better (using the appropriate statistical tests) than decision trees, Nave Bayes classifiers and Radial Basis Function networks. © 2011 IEEE.

Cite

CITATION STYLE

APA

Tarasov, A., & Delany, S. J. (2011). Benchmarking classification models for emotion recognition in natural speech: A multi-corporal study. In 2011 IEEE International Conference on Automatic Face and Gesture Recognition and Workshops, FG 2011 (pp. 841–845). https://doi.org/10.1109/FG.2011.5771359

Benchmarking classification models for emotion recognition in natural speech: A multi-corporal study

Abstract

Cite

Register to see more suggestions