The performance of autonomous agents in both commercial and consumer applications increases along with their situational awareness. Tasks such as obstacle avoidance, agent to agent interaction, and path planning are directly dependent upon their ability to convert sensor readings into scene understanding. Central to this is the ability to detect and recognize objects. Many object detection methodologies operate on a single modality such as vision or LiDAR. Camera-based object detection models benefit from an abundance of feature-rich information for classifying different types of objects. LiDAR-based object detection models use sparse point clouds, where each point contains accurate 3D position of object surfaces. Camera-based methods lack accurate object to lens distance measurements, while LiDAR-based methods lack dense feature-rich details. By utilizing information from both camera and LiDAR sensors, advanced object detection and identification is possible. In this work, we introduce a deep learning framework for fusing these modalities and produce a robust real-time 3D bounding box object detection network. We demonstrate qualitative and quantitative analysis of the proposed fusion model on the popular KITTI dataset.
CITATION STYLE
Bhanushali, D., Relyea, R., Manghi, K., Vashist, A., Hochgraf, C., Ganguly, A., … Ptucha, R. (2020). LiDAR-camera fusion for 3D object detection. In IS and T International Symposium on Electronic Imaging Science and Technology (Vol. 2020). Society for Imaging Science and Technology. https://doi.org/10.2352/ISSN.2470-1173.2020.16.AVM-257
Mendeley helps you to discover research relevant for your work.