A multimodal information collector for content-based image retrieval system

He Zhang; Mats Sjöberg; Jorma Laaksonen; Erkki Oja

Conference Proceedings

A multimodal information collector for content-based image retrieval system

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2011) 7064 LNCS(PART 3) 737-746

DOI: 10.1007/978-3-642-24965-5_83

0Citations

1Readers

Get full text

Abstract

Explicit relevance feedback requires the user to explicitly refine the search queries for content-based image retrieval. This may become laborious or even impossible due to the ever-increasing volume of digital databases. We present a multimodal information collector that can unobtrusively record and asynchronously transmit the user's implicit relevance feedback on a displayed image to the remote CBIR server for assisting in retrieving relevant images. The modalities of user interaction include eye movements, pointer tracks and clicks, keyboard strokes, and audio including speech. The client-side information collector has been implemented as a browser extension using the JavaScript programming language and has been integrated with an existing CBIR server. We verify its functionality by evaluating the performance of the gaze-enhanced CBIR system in on-line image tagging tasks. © 2011 Springer-Verlag.

Author supplied keywords

Cite

CITATION STYLE

APA

Zhang, H., Sjöberg, M., Laaksonen, J., & Oja, E. (2011). A multimodal information collector for content-based image retrieval system. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 7064 LNCS, pp. 737–746). https://doi.org/10.1007/978-3-642-24965-5_83

A multimodal information collector for content-based image retrieval system

Abstract

Author supplied keywords

Cite

Register to see more suggestions