SC-wLS: Towards Interpretable Feed-forward Camera Re-localization

6Citations
Citations of this article
13Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Visual re-localization aims to recover camera poses in a known environment, which is vital for applications like robotics or augmented reality. Feed-forward absolute camera pose regression methods directly output poses by a network, but suffer from low accuracy. Meanwhile, scene coordinate based methods are accurate, but need iterative RANSAC post-processing, which brings challenges to efficient end-to-end training and inference. In order to have the best of both worlds, we propose a feed-forward method termed SC-wLS that exploits all scene coordinate estimates for weighted least squares pose regression. This differentiable formulation exploits a weight network imposed on 2D-3D correspondences, and requires pose supervision only. Qualitative results demonstrate the interpretability of learned weights. Evaluations on 7Scenes and Cambridge datasets show significantly promoted performance when compared with former feed-forward counterparts. Moreover, our SC-wLS method enables a new capability: self-supervised test-time adaptation on the weight network. Codes and models are publicly available.

Cite

CITATION STYLE

APA

Wu, X., Zhao, H., Li, S., Cao, Y., & Zha, H. (2022). SC-wLS: Towards Interpretable Feed-forward Camera Re-localization. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 13661 LNCS, pp. 585–601). Springer Science and Business Media Deutschland GmbH. https://doi.org/10.1007/978-3-031-19769-7_34

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free