Action recognition using hybrid feature descriptor and VLAD video encoding

Dong Xing; Xianzhong Wang; Hongtao Lu

Conference Proceedings

Action recognition using hybrid feature descriptor and VLAD video encoding

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2015) 9008 99-112

DOI: 10.1007/978-3-319-16628-5_8

4Citations

11Readers

Get full text

Abstract

Human action recognition in video has found widespread applications in many fields. However, this task is still facing many challenges due to the existence of intra-class diversity and inter-class overlaps among different action categories. The key trick of action recognition lies in the extraction of more comprehensive features to cover the action, as well as a compact and discriminative video encoding representation. Based on this observation, in this paper we propose a hybrid feature descriptor, which combines both static descriptor and motional descriptor to cover more action information inside video clips. We also adopt the usage of VLAD encoding method to encapsulate more structural information within the distribution of feature vectors. The recognition effects of our framework are evaluated on three benchmark datasets: KTH, Weizmann, and YouTube. The experimental results demonstrate that the hybrid descriptor, facilitated with VLAD encoding method, outperforms traditional descriptors by a large margin.

Cite

CITATION STYLE

APA

Xing, D., Wang, X., & Lu, H. (2015). Action recognition using hybrid feature descriptor and VLAD video encoding. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 9008, pp. 99–112). Springer Verlag. https://doi.org/10.1007/978-3-319-16628-5_8

Action recognition using hybrid feature descriptor and VLAD video encoding

Abstract

Cite

Register to see more suggestions