Neural Network Approach for 2-Dimension Person Pose Estimation with Encoded Mask and Keypoint Detection

5Citations
Citations of this article
12Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

We present an innovative approach for 2D person pose estimation by developing a convolutional neural network for human 2-channel mask prediction and human 2D pose estimation. Conceptually our idea is simple, inspired by prior image segmentation research in general. We establish a perception that explicitly encoded mask data can be served as a critical feature for person pose estimation. We propose a convolution neural network model by combining the image segmentation technique with the bottom-up approach for human pose estimation. We observe that the construction of a two stage-network for training in an end-to-end manner is beneficial to one another: for person mask prediction and 2D person pose estimation. At the pose estimation stage, we detect heat-maps against the person keypoints location from the mask information and their mutual connection relations. They are then used to estimate an ultimate pose in a way to remove the unwanted or occluded keypoints, as those keypoints may propagate across the network and lead to redundant pose estimation. We train and test our system on the MS-COCO dataset, and the experimental results validate the superior efficiency of the proposed methodology.

Cite

CITATION STYLE

APA

Rizwan, T., Cai, Y., Ahsan, M., Sohail, N., Nasr, E. A., & Mahmoud, H. A. (2020). Neural Network Approach for 2-Dimension Person Pose Estimation with Encoded Mask and Keypoint Detection. IEEE Access, 8, 107760–107771. https://doi.org/10.1109/ACCESS.2020.3001473

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free