Learning pose grammar to encode human body configuration for 3D pose estimation

Hao Shu Fang; Yuanlu Xu; Wenguan Wang; Xiaobai Liu; Song Chun Zhu

Conference ProceedingsOPEN ACCESS

Learning pose grammar to encode human body configuration for 3D pose estimation

32nd AAAI Conference on Artificial Intelligence, AAAI 2018 (2018) 6821-6828

DOI: 10.1609/aaai.v32i1.12270

292Citations

178Readers

Abstract

In this paper, we propose a pose grammar to tackle the problem of 3D human pose estimation. Our model directly takes 2D pose as input and learns a generalized 2D-3D mapping function. The proposed model consists of a base network which efficiently captures pose-aligned features and a hierarchy of Bi-directional RNNs (BRNN) on the top to explicitly incorporate a set of knowledge regarding human body configuration (i.e., kinematics, symmetry, motor coordination). The proposed model thus enforces high-level constraints over human poses. In learning, we develop a pose sample simulator to augment training samples in virtual camera views, which further improves our model generalizability. We validate our method on public 3D human pose benchmarks and propose a new evaluation protocol working on cross-view setting to verify the generalization capability of different methods. We empirically observe that most state-of-the-art methods encounter difficulty under such setting while our method can well handle such challenges.

Cite

CITATION STYLE

APA

Fang, H. S., Xu, Y., Wang, W., Liu, X., & Zhu, S. C. (2018). Learning pose grammar to encode human body configuration for 3D pose estimation. In 32nd AAAI Conference on Artificial Intelligence, AAAI 2018 (pp. 6821–6828). AAAI press. https://doi.org/10.1609/aaai.v32i1.12270

Learning pose grammar to encode human body configuration for 3D pose estimation

Abstract

Cite

Register to see more suggestions