Real-Time Multi-person Motion Capture from Multi-view Video and IMUs

36Citations
Citations of this article
63Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

A real-time motion capture system is presented which uses input from multiple standard video cameras and inertial measurement units (IMUs). The system is able to track multiple people simultaneously and requires no optical markers, specialized infra-red cameras or foreground/background segmentation, making it applicable to general indoor and outdoor scenarios with dynamic backgrounds and lighting. To overcome limitations of prior video or IMU-only approaches, we propose to use flexible combinations of multiple-view, calibrated video and IMU input along with a pose prior in an online optimization-based framework, which allows the full 6-DoF motion to be recovered including axial rotation of limbs and drift-free global position. A method for sorting and assigning raw input 2D keypoint detections into corresponding subjects is presented which facilitates multi-person tracking and rejection of any bystanders in the scene. The approach is evaluated on data from several indoor and outdoor capture environments with one or more subjects and the trade-off between input sparsity and tracking performance is discussed. State-of-the-art pose estimation performance is obtained on the Total Capture (mutli-view video and IMU) and Human 3.6M (multi-view video) datasets. Finally, a live demonstrator for the approach is presented showing real-time capture, solving and character animation using a light-weight, commodity hardware setup.

References Powered by Scopus

Long Short-Term Memory

78319Citations
N/AReaders
Get full text

Realtime multi-person 2D pose estimation using part affinity fields

4756Citations
N/AReaders
Get full text

SMPL: A skinned multi-person linear model

2992Citations
N/AReaders
Get full text

Cited by Powered by Scopus

Physical Inertial Poser (PIP): Physics-aware Real-time Human Motion Tracking from Sparse Inertial Sensors

135Citations
N/AReaders
Get full text

SelfPose: 3D Egocentric Pose Estimation From a Headset Mounted Camera

32Citations
N/AReaders
Get full text

LiDAR-aid Inertial Poser: Large-scale Human Motion Capture by Sparse Inertial and LiDAR Sensors

30Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Malleson, C., Collomosse, J., & Hilton, A. (2020). Real-Time Multi-person Motion Capture from Multi-view Video and IMUs. International Journal of Computer Vision, 128(6), 1594–1611. https://doi.org/10.1007/s11263-019-01270-5

Readers over time

‘19‘20‘21‘22‘23‘24‘2507142128

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 33

89%

Researcher 4

11%

Readers' Discipline

Tooltip

Computer Science 18

56%

Engineering 10

31%

Neuroscience 2

6%

Mathematics 2

6%

Save time finding and organizing research with Mendeley

Sign up for free
0