Speech Emotion Recognition Based on Deep Learning and Fuzzy Optimization

1Citations
Citations of this article
11Readers
Mendeley users who have this article in their library.
Get full text

Abstract

The automatic speech emotion recognition (SER) is a challenging and attractive task. To gain a high accuracy in SER, it is need to extract salient features from the speech. Therefore, in this paper, we first verify whether the deep convolutional neural network (DCNN) can improve the salient of hand-crafted acoustic features, and then based on the conclusion of the first step, considering the fuzziness of emotion, we propose a theoretical approach of fuzzy neural network (FNN) combined with deep learning to deal with the fuzziness of emotion. The first step work is tested on the CASIA Chinese Emotional Corpus, and the second part is introduced in theory and analyzed about its feasibility.

Cite

CITATION STYLE

APA

Xu, J. cheng, & Xiao, N. feng. (2018). Speech Emotion Recognition Based on Deep Learning and Fuzzy Optimization. In Advances in Intelligent Systems and Computing (Vol. 690, pp. 377–383). Springer Verlag. https://doi.org/10.1007/978-3-319-65978-7_58

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free