An Unsupervised Monocular Visual Odometry Based on Multi-Scale Modeling

1Citations
Citations of this article
6Readers
Mendeley users who have this article in their library.

Abstract

Unsupervised deep learning methods have shown great success in jointly estimating camera pose and depth from monocular videos. However, previous methods mostly ignore the importance of multi-scale information, which is crucial for pose estimation and depth estimation, especially when the motion pattern is changed. This article proposes an unsupervised framework for monocular visual odometry (VO) that can model multi-scale information. The proposed method utilizes densely linked atrous convolutions to increase the receptive field size without losing image information, and adopts a non-local self-attention mechanism to effectively model the long-range dependency. Both of them can model objects of different scales in the image, thereby improving the accuracy of VO, especially in rotating scenes. Extensive experiments on the KITTI dataset have shown that our approach is competitive with other state-of-the-art unsupervised learning-based monocular methods and is comparable to supervised or model-based methods. In particular, we have achieved state-of-the-art results on rotation estimation.

References Powered by Scopus

Deep residual learning for image recognition

174383Citations
N/AReaders
Get full text

U-net: Convolutional networks for biomedical image segmentation

65065Citations
N/AReaders
Get full text

ImageNet: A Large-Scale Hierarchical Image Database

51038Citations
N/AReaders
Get full text

Cited by Powered by Scopus

Advanced Monocular Outdoor Pose Estimation in Autonomous Systems: Leveraging Optical Flow, Depth Estimation, and Semantic Segmentation with Dynamic Object Removal

0Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Zhi, H., Yin, C., Li, H., & Pang, S. (2022). An Unsupervised Monocular Visual Odometry Based on Multi-Scale Modeling. Sensors, 22(14). https://doi.org/10.3390/s22145193

Readers' Seniority

Tooltip

Lecturer / Post doc 2

50%

PhD / Post grad / Masters / Doc 2

50%

Readers' Discipline

Tooltip

Computer Science 2

50%

Engineering 2

50%

Save time finding and organizing research with Mendeley

Sign up for free