Convolutional Auto-Encoder and Adversarial Domain Adaptation for Cross-Corpus Speech Emotion Recognition

Yang Wang; Hongliang Fu; Huawei Tao; Jing Yang; Hongyi Ge; Yue Xie

Journal ArticleOPEN ACCESS

Convolutional Auto-Encoder and Adversarial Domain Adaptation for Cross-Corpus Speech Emotion Recognition

IEICE Transactions on Information and Systems (2022) E105D(10) 1803-1806

DOI: 10.1587/transinf.2022EDL8045

2Citations

8Readers

Abstract

This letter focuses on the cross-corpus speech emotion recognition (SER) task, in which the training and testing speech signals in cross-corpus SER belong to different speech corpora. Existing algorithms are incapable of effectively extracting common sentiment information between different corpora to facilitate knowledge transfer. To address this challenging problem, a novel convolutional auto-encoder and adversarial domain adaptation (CAEADA) framework for cross-corpus SER is proposed. The framework first constructs a one-dimensional convolutional auto-encoder (1D-CAE) for feature processing, which can explore the correlation among adjacent one-dimensional statistic features and the feature representation can be enhanced by the architecture based on encoderdecoder-style. Subsequently the adversarial domain adaptation (ADA) module alleviates the feature distributions discrepancy between the source and target domains by confusing domain discriminator, and specifically employs maximum mean discrepancy (MMD) to better accomplish feature transformation. To evaluate the proposed CAEADA, extensive experiments were conducted on EmoDB, eNTERFACE, and CASIA speech corpora, and the results show that the proposed method outperformed other approaches.

Author supplied keywords

Cite

CITATION STYLE

APA

Wang, Y., Fu, H., Tao, H., Yang, J., Ge, H., & Xie, Y. (2022). Convolutional Auto-Encoder and Adversarial Domain Adaptation for Cross-Corpus Speech Emotion Recognition. IEICE Transactions on Information and Systems, E105D(10), 1803–1806. https://doi.org/10.1587/transinf.2022EDL8045

Convolutional Auto-Encoder and Adversarial Domain Adaptation for Cross-Corpus Speech Emotion Recognition

Abstract

Author supplied keywords

Cite

Register to see more suggestions