Multiresolution models for object detection

Dennis Park; Deva Ramanan; Charless Fowlkes

Conference ProceedingsOPEN ACCESS

Multiresolution models for object detection

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2010) 6314 LNCS(PART 4) 241-254

DOI: 10.1007/978-3-642-15561-1_18

128Citations

226Readers

Abstract

Most current approaches to recognition aim to be scale-invariant. However, the cues available for recognizing a 300 pixel tall object are qualitatively different from those for recognizing a 3 pixel tall object. We argue that for sensors with finite resolution, one should instead use scale-variant, or multiresolution representations that adapt in complexity to the size of a putative detection window. We describe a multiresolution model that acts as a deformable part-based model when scoring large instances and a rigid template with scoring small instances. We also examine the interplay of resolution and context, and demonstrate that context is most helpful for detecting low-resolution instances when local models are limited in discriminative power. We demonstrate impressive results on the Caltech Pedestrian benchmark, which contains object instances at a wide range of scales. Whereas recent state-of-the-art methods demonstrate missed detection rates of 86%-37% at 1 false-positive-per-image, our multiresolution model reduces the rate to 29%. © 2010 Springer-Verlag.

Cite

CITATION STYLE

APA

Park, D., Ramanan, D., & Fowlkes, C. (2010). Multiresolution models for object detection. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 6314 LNCS, pp. 241–254). Springer Verlag. https://doi.org/10.1007/978-3-642-15561-1_18

Multiresolution models for object detection

Abstract

Cite

Register to see more suggestions