Abstract
The ability to produce large-scale maps for nav-igation, path planning and other tasks is a crucial step for autonomous agents, but has always been challenging. In this work, we introduce BEV-SLAM, a novel type of graph-based SLAM that aligns semantically-segmented Bird's Eye View (BEV) predictions from monocular cameras. We introduce a novel form of occlusion reasoning into BEV estimation and demonstrate its importance to aid spatial aggregation of BEV predictions. The result is a versatile SLAM system that can operate across arbitrary multi-camera configurations and can be seamlessly integrated with other sensors. We show that the use of multiple cameras significantly increases performance, and achieves lower relative error than high-performance GPS. The resulting system is able to create large, dense, globally-consistent world maps from monocular cameras mounted around an ego vehicle. The maps are metric and correctly-scaled, making them suitable for downstream navigation tasks.
Cite
CITATION STYLE
Ross, J., Mendez, O., Saha, A., Johnson, M., & Bowden, R. (2022). BEV-SLAM: Building a Globally-Consistent World Map Using Monocular Vision. In IEEE International Conference on Intelligent Robots and Systems (Vol. 2022-October, pp. 3830–3836). Institute of Electrical and Electronics Engineers Inc. https://doi.org/10.1109/IROS47612.2022.9981258
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.