MR-CapsNet: A Deep Learning Algorithm for Image-Based Head Pose Estimation on CapsNet

6Citations
Citations of this article
10Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

Head pose estimation based on a single image is a challenging endeavor because of the complex background conditions and characteristics of the human face. In this report, we propose a Multi stage Regression-Capsule Network (MR-CapsNet) to predict head posture based on a single image input. In the study, we used the residual attention block and squeeze-and-excitation block to extract features in three levels. CapsNet overcomes the shortcomings of the traditional convolutional neural network and implements module aggregation to describe the spatial relationship of features after aggregation, in addition to realizing a compact and robust model using a multi-stage regression scheme. We tested our method on the AFLW2000 and BIWI datasets obtaining mean absolute errors of 4.26% and 3.95%, respectively. In addition, we discuss the accuracy of our method in the case of eye or mouth occlusion. The results of comprehensive experiments reveal that our method can accurately predict head posture.

Cite

CITATION STYLE

APA

Fang, H., Liu, J. Q., Xie, K., Wu, P., Zhang, X. Y., Wen, C., & He, J. B. (2021). MR-CapsNet: A Deep Learning Algorithm for Image-Based Head Pose Estimation on CapsNet. IEEE Access, 9, 141245–141257. https://doi.org/10.1109/ACCESS.2021.3119615

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free