On reducing the number of visual words in the Bag-of-Features representation

Citations of this article
Mendeley users who have this article in their library.
Get full text


A new class of applications based on visual search engines are emerging, especially on smart-phones that have evolved into powerful tools for processing images and videos. The state-of-the-art algorithms for large visual content recognition and content based similarity search today use the "Bag of Features" (BoF) or "Bag of Words" (BoW) approach. The idea, borrowed from text retrieval, enables the use of inverted files. A very well known issue with this approach is that the query images, as well as the stored data, are described with thousands of words. This poses obvious efficiency problems when using inverted files to perform efficient image matching. In this paper, we propose and compare various techniques to reduce the number of words describing an image to improve efficiency and we study the effects of this reduction on effectiveness in landmark recognition and retrieval scenarios. We show that very relevant improvement in performance are achievable still preserving the advantages of the BoF base approach.




Amato, G., Falchi, F., & Gennaro, C. (2013). On reducing the number of visual words in the Bag-of-Features representation. In VISAPP 2013 - Proceedings of the International Conference on Computer Vision Theory and Applications (Vol. 1, pp. 657–662). https://doi.org/10.5220/0004290506570662

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free