MR-CapsNet: A Deep Learning Algorithm for Image-Based Head Pose Estimation on CapsNet

Hao Fang; Jun Qing Liu; Kai Xie; Peng Wu; Xin Yu Zhang; Chang Wen; Jian Biao He

Journal ArticleOPEN ACCESS

MR-CapsNet: A Deep Learning Algorithm for Image-Based Head Pose Estimation on CapsNet

IEEE Access (2021) 9 141245-141257

DOI: 10.1109/ACCESS.2021.3119615

6Citations

10Readers

Abstract

Head pose estimation based on a single image is a challenging endeavor because of the complex background conditions and characteristics of the human face. In this report, we propose a Multi stage Regression-Capsule Network (MR-CapsNet) to predict head posture based on a single image input. In the study, we used the residual attention block and squeeze-and-excitation block to extract features in three levels. CapsNet overcomes the shortcomings of the traditional convolutional neural network and implements module aggregation to describe the spatial relationship of features after aggregation, in addition to realizing a compact and robust model using a multi-stage regression scheme. We tested our method on the AFLW2000 and BIWI datasets obtaining mean absolute errors of 4.26% and 3.95%, respectively. In addition, we discuss the accuracy of our method in the case of eye or mouth occlusion. The results of comprehensive experiments reveal that our method can accurately predict head posture.

Author supplied keywords

Cite

CITATION STYLE

APA

Fang, H., Liu, J. Q., Xie, K., Wu, P., Zhang, X. Y., Wen, C., & He, J. B. (2021). MR-CapsNet: A Deep Learning Algorithm for Image-Based Head Pose Estimation on CapsNet. IEEE Access, 9, 141245–141257. https://doi.org/10.1109/ACCESS.2021.3119615

MR-CapsNet: A Deep Learning Algorithm for Image-Based Head Pose Estimation on CapsNet

Abstract

Author supplied keywords

Cite

Register to see more suggestions