Recognizing realistic action using contextual feature group

2Citations
Citations of this article
3Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Although the spatial-temporal local features and the bag of visual words model (BoW) have achieved a great success and a wide adoption in action classification, there still remain some problems. First, the local features extracted are not stable enough, which may be aroused by the background action or camera shake. Second, using local features alone ignores the spatial-temporal relationships of these features, which may decrease the classification accuracy. Finally, the distance mainly used in the clustering algorithm of the BoW model did not take the semantic context into consideration. Based on these problems, we proposed a systematic framework for recognizing realistic actions, with considering the spatial-temporal relationship between the pruned local features and utilizing a new discriminate group distance to incorporate the semantic context information. The Support Vector Machine (SVM) with multiple kernels is employed to make use of both the local feature and feature group information. The proposed method is evaluated on KTH dataset and a relatively realistic dataset YouTube. Experimental results validate our approach and the recognition performance is promising.

Cite

CITATION STYLE

APA

Ye, Y., Qin, L., Cheng, Z., & Huang, Q. (2013). Recognizing realistic action using contextual feature group. In The Era of Interactive Media (Vol. 9781461435013, pp. 459–469). Springer New York. https://doi.org/10.1007/978-1-4614-3501-3_38

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free