ROI-Attention vectorized CNN model for static facial expression recognition

37Citations
Citations of this article
50Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

When using neural networks to recognize facial expressions, determining which features help identify different expressions is essential, and there is a massive information transmission loss between layers of network with multiple layers. This paper proposes a robust vectorized convolutional neural network (CNN) model that introduces an attention mechanism for extracting features in the region of interests(ROIs) of the face. The ROIs in the facial image are marked before the image is input into the neural network. In particular, the attention concept is adopted in the first layer of the proposed neural network to perform ROIs-related convolution calculation, and ROIs-related convolution calculation results of the specific fields in the ROIs are increased by extracting more robust features. Next, the idea of features' vectors inspired by CapsNet is used in the following layer of the proposed neural network. Multi-level convolutions are used to extract feature vectors of different ROIs for facial expression, and then the feature vectors are reconstructed by a decoder to reconstruct the image. Comprehensive comparative experiments and cross-database experiments are conducted to verify the validity and robustness of our proposed model. The experimental results also demonstrate that our method is very effective in improving the performance of facial expressions recognition.

Cite

CITATION STYLE

APA

Sun, X., Zheng, S., & Fu, H. (2020). ROI-Attention vectorized CNN model for static facial expression recognition. IEEE Access, 8, 7183–7194. https://doi.org/10.1109/ACCESS.2020.2964298

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free