Gesture recognition algorithm based on multiscale feature fusion in RGB-D images

62Citations
Citations of this article
17Readers
Mendeley users who have this article in their library.

Abstract

With the rapid development of sensor technology and artificial intelligence, the video gesture recognition technology under the background of big data makes human–computer interaction more natural and flexible, bringing the richer interactive experience to teaching, on-board control, electronic games etc. To perform robust recognition under the conditions of illumination change, background clutter, rapid movement, and partial occlusion, an algorithm based on multi-level feature fusion of two-stream convolutional neural network is proposed, which includes three main steps. Firstly, the Kinect sensor obtains red–green–blue-depth (RGB-D) images to establish a gesture database. At the same time, data enhancement is performed on the training set and test set. Then, a model of multi-level feature fusion of a two-stream convolutional neural network is established and trained. Experiments show that the proposed network model can robustly track and recognise gestures under complex backgrounds (such as similar complexion, illumination changes, and occlusion), and compared with the single-channel model, the average detection accuracy is improved by 1.08%, and mean average precision is improved by 3.56%.

References Powered by Scopus

SSD: Single shot multibox detector

24773Citations
N/AReaders
Get full text

6D hands: Markerless hand tracking for computer aided design

204Citations
N/AReaders
Get full text

Review of constraints on vision-based gesture recognition for human-computer interaction

201Citations
N/AReaders
Get full text

Cited by Powered by Scopus

Semantic segmentation for multiscale target based on object recognition using the improved Faster-RCNN model

172Citations
N/AReaders
Get full text

Detection algorithm of safety helmet wearing based on deep learning

147Citations
N/AReaders
Get full text

Manipulator grabbing position detection with information fusion of color image and depth image using deep learning

131Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Sun, Y., Weng, Y., Luo, B., Li, G., Tao, B., Du, J., & Chen, D. (2020). Gesture recognition algorithm based on multiscale feature fusion in RGB-D images. IET Image Processing, 14(15), 3662–3668. https://doi.org/10.1049/iet-ipr.2020.0148

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 4

50%

Professor / Associate Prof. 2

25%

Lecturer / Post doc 2

25%

Readers' Discipline

Tooltip

Engineering 5

50%

Computer Science 3

30%

Business, Management and Accounting 1

10%

Arts and Humanities 1

10%

Save time finding and organizing research with Mendeley

Sign up for free