Multi-person 3D pose estimation from unlabelled data

Daniel Rodriguez-Criado; Pilar Bachiller-Burgos; George Vogiatzis; Luis J. Manso

Journal ArticleOPEN ACCESS

Multi-person 3D pose estimation from unlabelled data

Machine Vision and Applications (2024) 35(3)

DOI: 10.1007/s00138-024-01530-6

2Citations

12Readers

Abstract

Its numerous applications make multi-human 3D pose estimation a remarkably impactful area of research. Nevertheless, it presents several challenges, especially when approached using multiple views and regular RGB cameras as the only input. First, each person must be uniquely identified in the different views. Secondly, it must be robust to noise, partial occlusions, and views where a person may not be detected. Thirdly, many pose estimation approaches rely on environment-specific annotated datasets that are frequently prohibitively expensive and/or require specialised hardware. Specifically, this is the first multi-camera, multi-person data-driven approach that does not require an annotated dataset. In this work, we address these three challenges with the help of self-supervised learning. In particular, we present a three-staged pipeline and a rigorous evaluation providing evidence that our approach performs faster than other state-of-the-art algorithms, with comparable accuracy, and most importantly, does not require annotated datasets. The pipeline is composed of a 2D skeleton detection step, followed by a Graph Neural Network to estimate cross-view correspondences of the people in the scenario, and a Multi-Layer Perceptron that transforms the 2D information into 3D pose estimations. Our proposal comprises the last two steps, and it is compatible with any 2D skeleton detector as input. These two models are trained in a self-supervised manner, thus avoiding the need for datasets annotated with 3D ground-truth poses.

Author supplied keywords

References Powered by Scopus

View more at Scopus

Cited by Powered by Scopus

View more at Scopus

Cite

CITATION STYLE

APA

Rodriguez-Criado, D., Bachiller-Burgos, P., Vogiatzis, G., & Manso, L. J. (2024). Multi-person 3D pose estimation from unlabelled data. Machine Vision and Applications, 35(3). https://doi.org/10.1007/s00138-024-01530-6

Readers' Seniority

PhD / Post grad / Masters / Doc 4

80%

Researcher 1

20%

Readers' Discipline

Computer Science 3

50%

Engineering 2

33%

Neuroscience 1

17%

Multi-person 3D pose estimation from unlabelled data

Abstract

Author supplied keywords

References Powered by Scopus

Microsoft COCO: Common objects in context

Comparing partitions

Deep high-resolution representation learning for human pose estimation

Cited by Powered by Scopus

MHDA-KD: A Multi-Granularity hybrid driven approach of Knowledge-Data for pose detection in complex dynamic operating system

Register to see more suggestions

Cite

Readers' Seniority

Readers' Discipline