Light and Fast Hand Pose Estimation from Spatial-Decomposed Latent Heatmap

6Citations
Citations of this article
5Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

We present a light and efficient approach named Latent Fusion network for fast and accurate hand pose estimation from a single depth image. Our method innovatively decomposes 3D joint regression into 2D plane localization and 1D axis estimation from different spatial perspectives. We design multiple latent heatmap regression branches to predict hand pose separately and a fusion network to output the final result. Experiments on three public hand pose datasets (ICVL, NYU, MSRA) demonstrate that our system achieves state-of-the-art accuracy. Moreover, our method outperforms all top-ranked approaches by a large margin both in terms of inference speed (nearly a thousand frames per second) and model size (less than 10 MB).

Cite

CITATION STYLE

APA

Liu, S., Wang, G., Xie, P., & Zhang, C. (2020). Light and Fast Hand Pose Estimation from Spatial-Decomposed Latent Heatmap. IEEE Access, 8, 53072–53081. https://doi.org/10.1109/ACCESS.2020.2979507

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free