BEV-SLAM: Building a Globally-Consistent World Map Using Monocular Vision

14Citations
Citations of this article
21Readers
Mendeley users who have this article in their library.
Get full text

Abstract

The ability to produce large-scale maps for nav-igation, path planning and other tasks is a crucial step for autonomous agents, but has always been challenging. In this work, we introduce BEV-SLAM, a novel type of graph-based SLAM that aligns semantically-segmented Bird's Eye View (BEV) predictions from monocular cameras. We introduce a novel form of occlusion reasoning into BEV estimation and demonstrate its importance to aid spatial aggregation of BEV predictions. The result is a versatile SLAM system that can operate across arbitrary multi-camera configurations and can be seamlessly integrated with other sensors. We show that the use of multiple cameras significantly increases performance, and achieves lower relative error than high-performance GPS. The resulting system is able to create large, dense, globally-consistent world maps from monocular cameras mounted around an ego vehicle. The maps are metric and correctly-scaled, making them suitable for downstream navigation tasks.

Cite

CITATION STYLE

APA

Ross, J., Mendez, O., Saha, A., Johnson, M., & Bowden, R. (2022). BEV-SLAM: Building a Globally-Consistent World Map Using Monocular Vision. In IEEE International Conference on Intelligent Robots and Systems (Vol. 2022-October, pp. 3830–3836). Institute of Electrical and Electronics Engineers Inc. https://doi.org/10.1109/IROS47612.2022.9981258

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free