Visual word aggregation

R. J. López-Sastre; J. Renes-Olalla; P. Gil-Jiménez; S. Maldonado-Bascón

Conference Proceedings

Visual word aggregation

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2011) 6669 LNCS 676-683

DOI: 10.1007/978-3-642-21257-4_84

4Citations

5Readers

Get full text

Abstract

Most recent category-level object recognition systems work with visual words, i.e. vector quantized local descriptors. These visual vocabularies are usually constructed by using a single method such as K-means for clustering the descriptor vectors of patches sampled either densely or sparsely from a set of training images. Instead, in this paper we propose a novel methodology for building efficient codebooks for visual recognition using clustering aggregation techniques: the Visual Word Aggregation (VWA). Our aim is threefold: to increase the stability of the visual vocabulary construction process; to increase the image classification rate; and also to automatically determine the size of the visual codebook. Results on image classification are presented on the testbed PASCAL VOC Challenge 2007. © 2011 Springer-Verlag.

Author supplied keywords

Cite

CITATION STYLE

APA

López-Sastre, R. J., Renes-Olalla, J., Gil-Jiménez, P., & Maldonado-Bascón, S. (2011). Visual word aggregation. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 6669 LNCS, pp. 676–683). https://doi.org/10.1007/978-3-642-21257-4_84

Visual word aggregation

Abstract

Author supplied keywords

Cite

Register to see more suggestions