SimPose: Effectively Learning DensePose and Surface Normals of People from Simulated Data

13Citations
Citations of this article
89Readers
Mendeley users who have this article in their library.
Get full text

Abstract

With a proliferation of generic domain-adaptation approaches, we report a simple yet effective technique for learning difficult per-pixel 2.5D and 3D regression representations of articulated people. We obtained strong sim-to-real domain generalization for the 2.5D DensePose estimation task and the 3D human surface normal estimation task. On the multi-person DensePose MSCOCO benchmark, our approach outperforms the state-of-the-art methods which are trained on real images that are densely labelled. This is an important result since obtaining human manifold’s intrinsic uv coordinates on real images is time consuming and prone to labeling noise. Additionally, we present our model’s 3D surface normal predictions on the MSCOCO dataset that lacks any real 3D surface normal labels. The key to our approach is to mitigate the “Inter-domain Covariate Shift” with a carefully selected training batch from a mixture of domain samples, a deep batch-normalized residual network, and a modified multi-task learning objective. Our approach is complementary to existing domain-adaptation techniques and can be applied to other dense per-pixel pose estimation problems.

Cite

CITATION STYLE

APA

Zhu, T., Karlsson, P., & Bregler, C. (2020). SimPose: Effectively Learning DensePose and Surface Normals of People from Simulated Data. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 12374 LNCS, pp. 225–242). Springer Science and Business Media Deutschland GmbH. https://doi.org/10.1007/978-3-030-58526-6_14

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free