Unsupervised learning spatio-temporal features for human activity recognition from RGB-D video data

4Citations
Citations of this article
9Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Being able to recognize human activities is essential for several applications, including social robotics. The recently developed commodity depth sensors open up newpossibilities of dealingwith this problem. Existing techniques extract hand-tuned features, such as HOG3D or STIP, from video data. They are not adapting easily to new modalities. In addition, as the depth video data is lowquality due to the noise, we face a problem: does the depth video data provide extra information for activity recognition? To address this issue, we propose to use an unsupervised learning approach generally adapted to RGB and depth video data. we further employ the multi kernel learning (MKL) classifier to take into account the combinations of different modalities. We show that the low-quality depth video is discriminative for activity recognition. We also demonstrate that our approach achieves superior performance to the state-of-the-art approaches on two challenging RGB-D activity recognition datasets. © Springer International Publishing 2013.

Cite

CITATION STYLE

APA

Chen, G., Zhang, F., Giuliani, M., Buckl, C., & Knoll, A. (2013). Unsupervised learning spatio-temporal features for human activity recognition from RGB-D video data. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 8239 LNAI, pp. 341–350). https://doi.org/10.1007/978-3-319-02675-6_34

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free