The automatic speech emotion recognition (SER) is a challenging and attractive task. To gain a high accuracy in SER, it is need to extract salient features from the speech. Therefore, in this paper, we first verify whether the deep convolutional neural network (DCNN) can improve the salient of hand-crafted acoustic features, and then based on the conclusion of the first step, considering the fuzziness of emotion, we propose a theoretical approach of fuzzy neural network (FNN) combined with deep learning to deal with the fuzziness of emotion. The first step work is tested on the CASIA Chinese Emotional Corpus, and the second part is introduced in theory and analyzed about its feasibility.
CITATION STYLE
Xu, J. cheng, & Xiao, N. feng. (2018). Speech Emotion Recognition Based on Deep Learning and Fuzzy Optimization. In Advances in Intelligent Systems and Computing (Vol. 690, pp. 377–383). Springer Verlag. https://doi.org/10.1007/978-3-319-65978-7_58
Mendeley helps you to discover research relevant for your work.