RPNet: An end-to-end network for relative camera pose estimation

Sovann En; Alexis Lechervy; Frédéric Jurie

Conference Proceedings

RPNet: An end-to-end network for relative camera pose estimation

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2019) 11129 LNCS 738-745

DOI: 10.1007/978-3-030-11009-3_46

21Citations

76Readers

Get full text

Abstract

This paper addresses the task of relative camera pose estimation from raw image pixels, by means of deep neural networks. The proposed RPNet network takes pairs of images as input and directly infers the relative poses, without the need of camera intrinsic/extrinsic. While state-of-the-art systems based on SIFT + RANSAC, are able to recover the translation vector only up to scale, RPNet is trained to produce the full translation vector, in an end-to-end way. Experimental results on the Cambridge Landmark data set show very promising results regarding the recovery of the full translation vector. They also show that RPNet produces more accurate and more stable results than traditional approaches, especially for hard images (repetitive textures, textureless images, etc.). To the best of our knowledge, RPNet is the first attempt to recover full translation vectors in relative pose estimation.

Author supplied keywords

Cite

CITATION STYLE

APA

En, S., Lechervy, A., & Jurie, F. (2019). RPNet: An end-to-end network for relative camera pose estimation. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 11129 LNCS, pp. 738–745). Springer Verlag. https://doi.org/10.1007/978-3-030-11009-3_46

RPNet: An end-to-end network for relative camera pose estimation

Abstract

Author supplied keywords

Cite

Register to see more suggestions