Efficient media retrieval from non-cooperative queries

Kevin Shih; Wei Di; Vignesh Jagadeesh; Robinson Piramuthu

Conference Proceedings

Efficient media retrieval from non-cooperative queries

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2015) 9163 391-403

DOI: 10.1007/978-3-319-20904-3_35

0Citations

6Readers

Get full text

Abstract

Text is ubiquitous in the artificial world and easily attainable when it comes to book title and author names. Using the images from the book cover set from the Stanford Mobile Visual Search dataset and additional book covers and metadata from openlibrary. org, we construct a large scale book cover retrieval dataset, complete with 100K distractor covers and title and author strings for each. Because our query images are poorly conditioned for clean text extraction, we propose a method for extracting a matching noisy and erroneous OCR readings and matching it against clean author and book title strings in a standard document look-up problem setup. Finally, we demonstrate how to use this text-matching as a feature in conjunction with popular retrieval features such as VLAD using a simple learning setup to achieve significant improvements in retrieval accuracy over that of either VLAD or the text alone.

Author supplied keywords

Cite

CITATION STYLE

APA

Shih, K., Di, W., Jagadeesh, V., & Piramuthu, R. (2015). Efficient media retrieval from non-cooperative queries. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 9163, pp. 391–403). Springer Verlag. https://doi.org/10.1007/978-3-319-20904-3_35

Efficient media retrieval from non-cooperative queries

Abstract

Author supplied keywords

Cite

Register to see more suggestions