MM-Hand: 3D-Aware Multi-Modal Guided Hand Generation for 3D Hand Pose Synthesis

Zhenyu Wu; Duc Hoang; Shih Yao Lin; Yusheng Xie; Liangjian Chen; Yen Yu Lin; Zhangyang Wang; Wei Fan

Conference ProceedingsOPEN ACCESS

MM-Hand: 3D-Aware Multi-Modal Guided Hand Generation for 3D Hand Pose Synthesis

MM 2020 - Proceedings of the 28th ACM International Conference on Multimedia (2020) 2508-2516

DOI: 10.1145/3394171.3413555

9Citations

30Readers

Get full text

Abstract

Estimating the 3D hand pose from a monocular RGB image is important but challenging. A solution is training on large-scale RGB hand images with accurate 3D hand keypoint annotations. However, it is too expensive in practice. Instead, we develop a learning-based approach to synthesize realistic, diverse, and 3D pose-preserving hand images under the guidance of 3D pose information. We propose a 3D-aware multi-modal guided hand generative network (MM-Hand), together with a novel geometry-based curriculum learning strategy. Our extensive experimental results demonstrate that the 3D-annotated images generated by MM-Hand qualitatively and quantitatively outperform existing options. Moreover, the augmented data can consistently improve the quantitative performance of the state-of-the-art 3D hand pose estimators on two benchmark datasets. The code will be available at https://github.com/ScottHoang/mm-hand.

Author supplied keywords

References Powered by Scopus

View more at Scopus

Cited by Powered by Scopus

View more at Scopus

Cite

CITATION STYLE

APA

Wu, Z., Hoang, D., Lin, S. Y., Xie, Y., Chen, L., Lin, Y. Y., … Fan, W. (2020). MM-Hand: 3D-Aware Multi-Modal Guided Hand Generation for 3D Hand Pose Synthesis. In MM 2020 - Proceedings of the 28th ACM International Conference on Multimedia (pp. 2508–2516). Association for Computing Machinery, Inc. https://doi.org/10.1145/3394171.3413555

Readers over time

Readers' Seniority

PhD / Post grad / Masters / Doc 13

72%

Researcher 4

22%

Lecturer / Post doc 1

Readers' Discipline

Computer Science 15

83%

Engineering 2

11%

Mathematics 1

MM-Hand: 3D-Aware Multi-Modal Guided Hand Generation for 3D Hand Pose Synthesis

Abstract

Author supplied keywords

References Powered by Scopus

Unpaired Image-to-Image Translation Using Cycle-Consistent Adversarial Networks

Image-to-image translation with conditional adversarial networks

Convolutional pose machines

Cited by Powered by Scopus

MVHM: A large-scale multi-view hand mesh benchmark for accurate 3D hand pose estimation

Temporal-aware self-supervised learning for 3D hand pose and mesh estimation in videos

Model-aware gesture-to-gesture translation

Register to see more suggestions

Cite

Readers over time

Readers' Seniority

Readers' Discipline