Abstract
Bag of visual words model has recently attracted much attention from computer vision society because of its notable success in analysing images and exploring their content. This study improves this model by utilising the adjacency information between words. To explore this information, a binary tree structure is constructed from the visual words in order to model the is - a relationships in the vocabulary. Informative nodes of this tree are extracted by using the Χ2 criterion and are used to capture the adjacency information of visual words. This approach is a simple and computationally effective way for modelling the spatial relations of visual words, which improves the image classification performance. The authors evaluated our method for visual classification of three known datasets: 15 natural scenes, Caltech-101 and Graz-01.© The Institution of Engineering and Technology 2014.
Cite
CITATION STYLE
Farhangi, M. M., Soryani, M., & Fathy, M. (2014). Informative visual words construction to improve bag of words image representation. IET Image Processing, 8(5), 310–318. https://doi.org/10.1049/iet-ipr.2013.0449
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.