We present a novel algorithm for weakly supervised action classification in videos. We assume we are given training videos annotated only with action class labels. We learn a model that can classify unseen test videos, as well as localize a region of interest in the video that captures the discriminative essence of the action class. A novel Similarity Constrained Latent Support Vector Machine model is developed to operationalize this goal. This model specifies that videos should be classified correctly, and that the latent regions of interest chosen should be coherent over videos of an action class. The resulting learning problem is challenging, and we show how dual decomposition can be employed to render it tractable. Experimental results demonstrate the efficacy of the method. © 2012 Springer-Verlag.
CITATION STYLE
Shapovalova, N., Vahdat, A., Cannons, K., Lan, T., & Mori, G. (2012). Similarity constrained latent support vector machine: An application to weakly supervised action classification. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 7578 LNCS, pp. 55–68). https://doi.org/10.1007/978-3-642-33786-4_5
Mendeley helps you to discover research relevant for your work.