Hand pose estimation based on 3D residual network with data padding and skeleton steadying

4Citations
Citations of this article
7Readers
Mendeley users who have this article in their library.
Get full text

Abstract

In this paper, we propose a deep-learning-based approach for real-time hand pose estimation from a single depth image by using 3D convolutional neural network which takes a 3D voxelized grid generated by a depth image as input. Most of the previous works for hand pose estimation only take a single 2D depth image as input and estimate coordinates of the key points of a hand with 2D convolutional neural network. The disadvantage of those methods is that 2D depth image can not represent the spatial information of 3D data, while the 3D voxelized grid can represent the point cloud of the surface of the hand in a spatial way. Hence, we design a 3D convolutional neural network which takes a 3D voxelized grid with data padding as input and steadying the hand skeleton with an additional loss function for regression. Experiments show that our approach outperforms previous methods on two public datasets and can run in real time with a single GPU.

Cite

CITATION STYLE

APA

Ting, P. W., Chou, E. T., Tang, Y. H., & Fu, L. C. (2019). Hand pose estimation based on 3D residual network with data padding and skeleton steadying. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 11365 LNCS, pp. 293–307). Springer Verlag. https://doi.org/10.1007/978-3-030-20873-8_19

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free