Annotating and retrieving videos of human actions using matrix factorization

Fabián Páez; Fabio A. González

Conference ProceedingsOPEN ACCESS

Annotating and retrieving videos of human actions using matrix factorization

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2015) 9423 743-750

DOI: 10.1007/978-3-319-25751-8_89

0Citations

8Readers

Abstract

This paper presents a method for annotating and retrieving videos of human actions based on two-way matrix factorization. The method addresses the problem by modeling it as the problem of finding common latent space representation for multimodal objects. In this particular case, the modalities correspond to the visual and textual (annotations) information associated with videos, which are projected by the method to the latent space. Assuming this space exists, it is possible to map between input spaces, i.e. visual to textual, by projecting across the latent space. The mapping between the spaces is explicitly optimized in the cost function and learned from training data including both modalities. The algorithm may be used for annotation, by projecting only visual information and obtaining a textual representation, or for retrieval by indexing on the latent or textual spaces. Experimental evaluation shows competitive results when compared to state-of-the-art annotation and retrieval methods.

Cite

CITATION STYLE

APA

Páez, F., & González, F. A. (2015). Annotating and retrieving videos of human actions using matrix factorization. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 9423, pp. 743–750). Springer Verlag. https://doi.org/10.1007/978-3-319-25751-8_89

Annotating and retrieving videos of human actions using matrix factorization

Abstract

Cite

Register to see more suggestions