Mono3D++: Monocular 3D vehicle detection with two-scale 3D hypotheses and task priors

93Citations
Citations of this article
129Readers
Mendeley users who have this article in their library.

Abstract

We present a method to infer 3D pose and shape of vehicles from a single image. To tackle this ill-posed problem, we optimize two-scale projection consistency between the generated 3D hypotheses and their 2D pseudo-measurements. Specifically, we use a morphable wireframe model to generate a fine-scaled representation of vehicle shape and pose. To reduce its sensitivity to 2D landmarks, we jointly model the 3D bounding box as a coarse representation which improves robustness. We also integrate three task priors, including unsupervised monocular depth, a ground plane constraint as well as vehicle shape priors, with forward projection errors into an overall energy function.

Cite

CITATION STYLE

APA

He, T., & Soatto, S. (2019). Mono3D++: Monocular 3D vehicle detection with two-scale 3D hypotheses and task priors. In 33rd AAAI Conference on Artificial Intelligence, AAAI 2019, 31st Innovative Applications of Artificial Intelligence Conference, IAAI 2019 and the 9th AAAI Symposium on Educational Advances in Artificial Intelligence, EAAI 2019 (pp. 8409–8416). AAAI Press. https://doi.org/10.1609/aaai.v33i01.33018409

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free