Two-Dimensional Parallel Spatio-Temporal Pyramid Pooling for Hand Gesture Recognition

1Citations
Citations of this article
10Readers
Mendeley users who have this article in their library.

Abstract

Hand Gesture Recognition (HGR) plays a crucial role in user-friendly interactions between humans and computers. In recent years, using the Convolutional Neural Network (CNN) has improved the accuracy of image processing problems. Inspired by the high recognition rate of CNN and its efficiency, we propose a model for hand gesture recognition based on CNN and evaluate the results using images with plain and complex backgrounds. Recognizing different hand signs by Two-Dimensional Parallel Spatio-Temporal Pyramid Pooling (2DPSTPP) features with deep learning methods reduces the size of the map, minimizes training complexity, and by paying attention to more details, improves detection performance. The effectiveness of the proposed method is evaluated using regular cross-validation tests on six datasets, namely American Sign Language (ASL), the NUS hand posture dataset I, the NUS hand posture dataset II, the digits dataset, the hand gesture dataset, and the leap gesture recognition dataset.

Cite

CITATION STYLE

APA

Jafari, F., & Basu, A. (2023). Two-Dimensional Parallel Spatio-Temporal Pyramid Pooling for Hand Gesture Recognition. IEEE Access, 11, 133755–133766. https://doi.org/10.1109/ACCESS.2023.3336591

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free