RoIFusion: 3D Object Detection from LiDAR and Vision

Can Chen; Luca Zanotti Fragonara; Antonios Tsourdos

Journal ArticleOPEN ACCESS

RoIFusion: 3D Object Detection from LiDAR and Vision

IEEE Access (2021) 9 51710-51721

DOI: 10.1109/ACCESS.2021.3070379

37Citations

66Readers

Abstract

When localizing and detecting 3D objects for autonomous driving scenes, obtaining information from multiple sensors (e.g., camera, LIDAR) is capable of mutually offering useful complementary information to enhance the robustness of 3D detectors. In this paper, a deep neural network architecture, named RoIFusion, is proposed to efficiently fuse the multi-modality features for 3D object detection by leveraging the advantages of LIDAR and camera sensors. In order to achieve this task, instead of densely combining the point-wise feature of the point cloud with the related pixel features, our fusion method novelly aggregates a small set of 3D Region of Interests (RoIs) in the point clouds with the corresponding 2D RoIs in the images, which are beneficial for reducing the computation cost and avoiding the viewpoint misalignment during the feature aggregation from different sensors. Finally, Extensive experiments are performed on the KITTI 3D object detection challenging benchmark to show the effectiveness of our fusion method and demonstrate that our deep fusion approach achieves state-of-the-art performance.

Author supplied keywords

Cite

CITATION STYLE

APA

Chen, C., Fragonara, L. Z., & Tsourdos, A. (2021). RoIFusion: 3D Object Detection from LiDAR and Vision. IEEE Access, 9, 51710–51721. https://doi.org/10.1109/ACCESS.2021.3070379

RoIFusion: 3D Object Detection from LiDAR and Vision

Abstract

Author supplied keywords

Cite

Register to see more suggestions