We propose a discriminative patch-level model which combines appearance and spatial layout cues. We start from a block-sparse model of patch appearance based on the normalized Fisher vector representation. The appearance model is responsible for (i) selecting a discriminative subset of visual words, and (ii) identifying distinctive patches assigned to the selected subset. These patches are further filtered by a sparse spatial model operating on a novel representation of pairwise patch layout. We have evaluated the proposed pipeline in image classification and weakly supervised localization experiments on a public traffic sign dataset. The results show significant advantage of the combined model over state of the art appearance models.
CITATION STYLE
Zadrija, V., Krapac, J., Verbeek, J., & Šegvić, S. (2015). Patch-level spatial layout for classification and weakly supervised localization. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 9358, pp. 492–503). Springer Verlag. https://doi.org/10.1007/978-3-319-24947-6_41
Mendeley helps you to discover research relevant for your work.