An Extensive Analysis of the Vision-based Deep Learning Techniques for Action Recognition

0Citations
Citations of this article
6Readers
Mendeley users who have this article in their library.

Abstract

Action recognition involves the idea of localizing and classifying actions in a video over a sequence of frames. It can be thought of as an image classification task extended temporally. The information obtained over the multitude of frames is aggregated to comprehend the action classification output. Applications of action recognition systems range from assistance for healthcare systems to human-machine interaction. Action recognition has proven to be a challenging task as it poses many impediments including high computation cost, capturing extended context, designing complex architectures, and lack of benchmark datasets. Increasing the efficiency of algorithms in human action recognition can significantly improve the probability of implementing it in real-world scenarios. This paper has summarized the evolution of various action localization, classification, and detection algorithms applied to data from vision-based sensors. We have also reviewed the datasets that have been used for the action classification, localization, and detection process. We have further explored the areas of action classification, temporal and spatiotemporal action detection, which use convolution neural networks, recurrent neural networks, or a combination of both.

Cite

CITATION STYLE

APA

Manasa, R., Shukla, R., & Saranya, K. C. (2021). An Extensive Analysis of the Vision-based Deep Learning Techniques for Action Recognition. International Journal of Advanced Computer Science and Applications, 12(2), 604–611. https://doi.org/10.14569/IJACSA.2021.0120276

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free