Relevance assessment for visual video re-ranking

Javier Aldana-Iuit; Ondřej Chum; Jiři Matas

Conference Proceedings

Relevance assessment for visual video re-ranking

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2014) 8814 421-430

DOI: 10.1007/978-3-319-11758-4_46

0Citations

1Readers

Get full text

Abstract

The following problem is considered: Given a name or phrase specifying an object, collect images and videos from the internet possibly depicting the object using a textual query on their name or annotation. A visual model from the images is built and used to rank the videos by relevance to the object of interest. Shot relevance is defined as the duration of the visibility of the object of interest. The model is based on local image features. The relevant shot detection builds on wide baseline stereo matching. The method is tested on 10 text phrases corresponding to 10 landmarks. The pool of 100 videos collected querying You- Tube with includes seven relevant videos for each landmark. The implementation runs faster than real-time at 208 frames per second. Averaged over the set of landmarks, at recall 0.95 the method has mean precision of 0.65, and the mean Average Precision (mAP) of 0.92.

Author supplied keywords

Cite

CITATION STYLE

APA

Aldana-Iuit, J., Chum, O., & Matas, J. (2014). Relevance assessment for visual video re-ranking. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 8814, pp. 421–430). Springer Verlag. https://doi.org/10.1007/978-3-319-11758-4_46

Relevance assessment for visual video re-ranking

Abstract

Author supplied keywords

Cite

Register to see more suggestions