Deep High-Resolution Network with Double Attention Residual Blocks for Human Pose Estimation

8Citations
Citations of this article
10Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

To improve the accuracy of human pose estimation, a novel method based on the deep high-resolution network (HRNet) and equipped with double attention residual blocks is proposed. Firstly, the channel attention and spatial attention modules are added to the residual block of feature extraction, resulting in the network paying more attention to the target area which needs to be extracted important information and suppressed unimportant information. Moreover, this paper proposes a novel module, Parallel Residual Attention Block (PRAB), which parallels the $3\times 3$ group convolution of ResNeXt to the $3\times 3$ convolution layer in the Bottleneck of ResNet, and then adds channel attention and spatial attention modules to these two branches respectively. In this way, the network can further improve the accuracy of human keypoint detection without significantly increasing the computation overhead. To demonstrate the effectiveness of our method, a series of comparative experiments are conducted on the MPII Human Pose dataset and the COCO2017 keypoint detection dataset. Experimental results illustrate that the attention mechanism is effective to improve the accuracy of human pose estimation and the proposed PRAB obtained the best results 90.5% on MPII which outperforms the existing methods.

Cite

CITATION STYLE

APA

Huo, Z., Jin, H., Qiao, Y., & Luo, F. (2020). Deep High-Resolution Network with Double Attention Residual Blocks for Human Pose Estimation. IEEE Access, 8, 224947–224957. https://doi.org/10.1109/ACCESS.2020.3044885

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free