Sign up & Download
Sign in

Monocular 3D pose estimation and tracking by detection

by Mykhaylo Andriluka, Stefan Roth, Bernt Schiele
2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition ()

Abstract

Automatic recovery of 3D human pose from monocular image sequences is a challenging and important research topic with numerous applications. Although current methods are able to recover 3D pose for a single person in controlled environments, they are severely challenged by real-world scenarios, such as crowded street scenes. To address this problem, we propose a three-stage process building on a number of recent advances. The first stage obtains an initial estimate of the 2D articulation and viewpoint of the person from single frames. The second stage allows early data association across frames based on tracking-by-detection. These two stages successfully accumulate the available 2D image evidence into robust estimates of 2D limb positions over short image sequences (= tracklets). The third and final stage uses those tracklet-based estimates as robust image observations to reliably recover 3D pose. We demonstrate state-of-the-art performance on the HumanEva II benchmark, and also show the applicability of our approach to articulated 3D tracking in realistic street conditions.

Cite this document (BETA)

Readership Statistics

43 Readers on Mendeley
by Discipline
 
 
 
by Academic Status
 
53% Ph.D. Student
 
19% Student (Master)
 
7% Doctoral Student
by Country
 
5% Germany
 
5% United Kingdom
 
2% Turkey

Sign up today - FREE

Mendeley saves you time finding and organizing research. Learn more

  • All your research in one place
  • Add and import papers easily
  • Access it anywhere, anytime

Start using Mendeley in seconds!

Already have an account? Sign in