Human action recognition based on temporal pose CNN and multi-dimensional fusion

Yi Huang; Shang Hong Lai; Shao Heng Tai

Conference ProceedingsOPEN ACCESS

Human action recognition based on temporal pose CNN and multi-dimensional fusion

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2019) 11130 LNCS 426-440

DOI: 10.1007/978-3-030-11012-3_33

8Citations

15Readers

Abstract

To take advantage of recent advances in human pose estimation from images, we develop a deep neural network model for action recognition from videos by computing temporal human pose features with a 3D CNN model. The proposed temporal pose features can provide more discriminative human action information than previous video features, such as appearance and short-term motion. In addition, we propose a novel fusion network that combines temporal pose, spatial and motion feature maps for the classification by bridging the gap between the dimension difference between 3D and 2D CNN feature maps. We show that the proposed action recognition system provides superior accuracy compared to the previous methods through experiments on Sub-JHMDB and PennAction datasets.

Author supplied keywords

Cite

CITATION STYLE

APA

Huang, Y., Lai, S. H., & Tai, S. H. (2019). Human action recognition based on temporal pose CNN and multi-dimensional fusion. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 11130 LNCS, pp. 426–440). Springer Verlag. https://doi.org/10.1007/978-3-030-11012-3_33

Human action recognition based on temporal pose CNN and multi-dimensional fusion

Abstract

Author supplied keywords

Cite

Register to see more suggestions