Human activity recognition in video benchmarks: A survey

29Citations
Citations of this article
37Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Vision-based Human activity recognition is becoming a trendy area of research due to its broad application such as security and surveillance, human–computer interactions, patients monitoring system, and robotics. For the recognition of human activity various approaches have been developed and to test the performance on these video datasets. Hence, the objective of this survey paper is to outline the different video datasets and highlights their merits and demerits under practical considerations. We have categorized these datasets into two part. The first part consists two-dimensional (2D-RGB) datasets and the second part has three-dimensional (3D-RGB) datasets. The most prominent challenges involved in these datasets are occlusions, illumination variation, view variation, annotation, and fusion of modalities. The key specification of these datasets are resolutions, frame rate, actions/actors, background, and application domain. All specifications, challenges involved, and the comparison made in tabular form. We have also presented the state-of-the-art algorithms that give the highest accuracy on these datasets.

Cite

CITATION STYLE

APA

Singh, T., & Vishwakarma, D. K. (2019). Human activity recognition in video benchmarks: A survey. In Lecture Notes in Electrical Engineering (Vol. 526, pp. 247–259). Springer Verlag. https://doi.org/10.1007/978-981-13-2553-3_24

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free