What, where and how many? Combining object detectors and CRFs

L'Ubor Ladický; Paul Sturgess; Karteek Alahari; Chris Russell; Philip H.S. Torr

Conference ProceedingsOPEN ACCESS

What, where and how many? Combining object detectors and CRFs

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2010) 6314 LNCS(PART 4) 424-437

DOI: 10.1007/978-3-642-15561-1_31

155Citations

216Readers

Abstract

Computer vision algorithms for individual tasks such as object recognition, detection and segmentation have shown impressive results in the recent past. The next challenge is to integrate all these algorithms and address the problem of scene understanding. This paper is a step towards this goal. We present a probabilistic framework for reasoning about regions, objects, and their attributes such as object class, location, and spatial extent. Our model is a Conditional Random Field defined on pixels, segments and objects. We define a global energy function for the model, which combines results from sliding window detectors, and low-level pixel-based unary and pairwise relations. One of our primary contributions is to show that this energy function can be solved efficiently. Experimental results show that our model achieves significant improvement over the baseline methods on CamVid and pascal voc datasets. © 2010 Springer-Verlag.

Cite

CITATION STYLE

APA

Ladický, L., Sturgess, P., Alahari, K., Russell, C., & Torr, P. H. S. (2010). What, where and how many? Combining object detectors and CRFs. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 6314 LNCS, pp. 424–437). Springer Verlag. https://doi.org/10.1007/978-3-642-15561-1_31

What, where and how many? Combining object detectors and CRFs

Abstract

Cite

Register to see more suggestions