Robustly detecting people in real world scenes is a fundamental and challenging task in computer vision. State-of-the-art approaches use powerful learning methods and manually annotated image data. Importantly, these learning based approaches rely on the fact that the collected training data is representative of all relevant variations necessary to detect people. Rather than to collect and annotate ever more training data, this paper explores the possibility to use a 3D human shape and pose model from computer graphics to add relevant shape information to learn more powerful people detection models. By sampling from the space of 3D shapes we are able to control data variability while covering the major shape variations of humans which are often difficult to capture when collecting real-world training images. We evaluate our data generation method for a people detection model based on pictorial structures. As we show on a challenging multi-viewpoint dataset, the additional information contained in the 3D shape model helps to outperform models trained on image data alone (see e.g. Fig. 1). © 2011. The copyright of this document resides with its authors.
CITATION STYLE
Pishchulin, L., Jain, A., Wojek, C., Thormählen, T., & Schiele, B. (2011). In good shape: Robust people detection based on appearance and shape. In BMVC 2011 - Proceedings of the British Machine Vision Conference 2011. British Machine Vision Association, BMVA. https://doi.org/10.5244/C.25.5
Mendeley helps you to discover research relevant for your work.