Combining densely sampled form and motion for human action recognition

Konrad Schindler; Luc Van Gool

Conference Proceedings

Combining densely sampled form and motion for human action recognition

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2008) 5096 LNCS 122-131

DOI: 10.1007/978-3-540-69321-5_13

4Citations

19Readers

Get full text

Abstract

We present a method for human action recognition from video, which exploits both form (local shape) and motion (local flow). Inspired by models of the human visual system, the two feature sets are processed independently in separate channels. The form channel extracts a dense local shape representation from every frame, while the motion channel extracts dense optic flow from the frame and its immediate predecessor. The same processing pipeline is applied in both channels: feature maps are pooled locally, down-sampled, and compared to a collection of learnt templates, yielding a vector of similarity scores. In a final step, the two score vectors are merged, and recognition is performed with a discriminative classifier. In an evaluation on two standard datasets our method outperforms the state-of-the-art, confirming that the combination of form and motion improves recognition. © 2008 Springer-Verlag Berlin Heidelberg.

Cite

CITATION STYLE

APA

Schindler, K., & Van Gool, L. (2008). Combining densely sampled form and motion for human action recognition. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 5096 LNCS, pp. 122–131). https://doi.org/10.1007/978-3-540-69321-5_13

Combining densely sampled form and motion for human action recognition

Abstract

Cite

Register to see more suggestions