Towards semantic slam: 3d position and velocity estimation by fusing image semantic information with camera motion parameters for traffic scene analysis

Mostafa Mansour; Pavel Davidson; Oleg Stepanov; Robert Piché

Journal ArticleOPEN ACCESS

Towards semantic slam: 3d position and velocity estimation by fusing image semantic information with camera motion parameters for traffic scene analysis

Remote Sensing (2021) 13(3) 1-17

DOI: 10.3390/rs13030388

5Citations

9Readers

Abstract

In this paper, an EKF (Extended Kalman Filter)-based algorithm is proposed to estimate 3D position and velocity components of different cars in a scene by fusing the semantic information and car model, extracted from successive frames with camera motion parameters. First, a 2D virtual image of the scene is made using a prior knowledge of the 3D Computer Aided Design (CAD) models of the detected cars and their predicted positions. Then, a discrepancy, i.e., distance, between the actual image and the virtual image is calculated. The 3D position and the velocity components are recursively estimated by minimizing the discrepancy using EKF. The experiments on the KiTTi dataset show a good performance of the proposed algorithm with a position estimation error up to 3–5% at 30 m and velocity estimation error up to 1 m/s.

Author supplied keywords

Cite

CITATION STYLE

APA

Mansour, M., Davidson, P., Stepanov, O., & Piché, R. (2021). Towards semantic slam: 3d position and velocity estimation by fusing image semantic information with camera motion parameters for traffic scene analysis. Remote Sensing, 13(3), 1–17. https://doi.org/10.3390/rs13030388

Towards semantic slam: 3d position and velocity estimation by fusing image semantic information with camera motion parameters for traffic scene analysis

Abstract

Author supplied keywords

Cite

Register to see more suggestions