Attention-based dense LSTM for speech emotion recognition

Yue Xie; Ruiyu Liang; Zhenlin Liang; Li Zhao

Journal ArticleOPEN ACCESS

Attention-based dense LSTM for speech emotion recognition

IEICE Transactions on Information and Systems (2019) E102D(7) 1426-1429

DOI: 10.1587/transinf.2019EDL8019

45Citations

39Readers

Get full text

Abstract

Despite the widespread use of deep learning for speech emotion recognition, they are severely restricted due to the information loss in the high layer of deep neural networks, as well as the degradation problem. In order to efficiently utilize information and solve degradation, attention-based dense long short-term memory (LSTM) is proposed for speech emotion recognition. LSTM networks with the ability to process time series such as speech are constructed into which attention-based dense connections are introduced. That means the weight coefficients are added to skip-connections of each layer to distinguish the difference of the emotional information between layers and avoid the interference of redundant information from the bottom layer to the effective information from the top layer. The experiments demonstrate that proposed method improves the recognition performance by 12% and 7% on eNTERFACE and IEMOCAP corpus respectively.

Author supplied keywords

Cite

CITATION STYLE

APA

Xie, Y., Liang, R., Liang, Z., & Zhao, L. (2019). Attention-based dense LSTM for speech emotion recognition. IEICE Transactions on Information and Systems, E102D(7), 1426–1429. https://doi.org/10.1587/transinf.2019EDL8019

Attention-based dense LSTM for speech emotion recognition

Abstract

Author supplied keywords

Cite

Register to see more suggestions