Video based emotion recognition using CNN and BRNN

Youyi Cai; Wenming Zheng; Tong Zhang; Qiang Li; Zhen Cui; Jiayin Ye

Conference Proceedings

Video based emotion recognition using CNN and BRNN

Communications in Computer and Information Science (2016) 663 679-691

DOI: 10.1007/978-981-10-3005-5_56

17Citations

16Readers

Get full text

Abstract

Video-based Emotion recognition is a rather challenging computer vision task. It not only needs to model spatial information of each image frame, but also requires considering temporal contextual correlations among sequential frames. For this purpose, we propose a hierarchical deep network architecture to extract high-level spatial-temporal features. In this architecture, two classic deep neural networks, convolutional neutral networks (CNN) and bi-directional recurrent neutral networks (BRNN), are employed to respectively capture facial textural characteristics in spatial domain and dynamic emotion changes in temporal domain. We endeavor to coordinate the two networks by optimizing each of them, so as to boost the performance of the emotion recognition. In the challenging competition, our method achieves a promising performance compared with the baselines.

Author supplied keywords

Cite

CITATION STYLE

APA

Cai, Y., Zheng, W., Zhang, T., Li, Q., Cui, Z., & Ye, J. (2016). Video based emotion recognition using CNN and BRNN. In Communications in Computer and Information Science (Vol. 663, pp. 679–691). Springer Verlag. https://doi.org/10.1007/978-981-10-3005-5_56

Video based emotion recognition using CNN and BRNN

Abstract

Author supplied keywords

Cite

Register to see more suggestions