Towards semantic slam: 3d position and velocity estimation by fusing image semantic information with camera motion parameters for traffic scene analysis

5Citations
Citations of this article
9Readers
Mendeley users who have this article in their library.

Abstract

In this paper, an EKF (Extended Kalman Filter)-based algorithm is proposed to estimate 3D position and velocity components of different cars in a scene by fusing the semantic information and car model, extracted from successive frames with camera motion parameters. First, a 2D virtual image of the scene is made using a prior knowledge of the 3D Computer Aided Design (CAD) models of the detected cars and their predicted positions. Then, a discrepancy, i.e., distance, between the actual image and the virtual image is calculated. The 3D position and the velocity components are recursively estimated by minimizing the discrepancy using EKF. The experiments on the KiTTi dataset show a good performance of the proposed algorithm with a position estimation error up to 3–5% at 30 m and velocity estimation error up to 1 m/s.

Cite

CITATION STYLE

APA

Mansour, M., Davidson, P., Stepanov, O., & Piché, R. (2021). Towards semantic slam: 3d position and velocity estimation by fusing image semantic information with camera motion parameters for traffic scene analysis. Remote Sensing, 13(3), 1–17. https://doi.org/10.3390/rs13030388

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free