Drnet: A depth-based regression network for 6d object pose estimation

Lei Jin; Xiaojuan Wang; Mingshu He; Jingyue Wang

Journal ArticleOPEN ACCESS

Drnet: A depth-based regression network for 6d object pose estimation

Sensors (2021) 21(5) 1-15

DOI: 10.3390/s21051692

7Citations

11Readers

Abstract

This paper focuses on 6Dof object pose estimation from a single RGB image. We tackle this challenging problem with a two-stage optimization framework. More specifically, we first introduce a translation estimation module to provide an initial translation based on an estimated depth map. Then, a pose regression module combines the ROI (Region of Interest) and the original image to predict the rotation and refine the translation. Compared with previous end-to-end methods that directly predict rotations and translations, our method can utilize depth information as weak guidance and significantly reduce the searching space for the subsequent module. Furthermore, we design a new loss function function for symmetric objects, an approach that has handled such exceptionally difficult cases in prior works. Experiments show that our model achieves state-of-the-art object pose estimation for the YCB-video dataset (Yale-CMU-Berkeley).

Author supplied keywords

References Powered by Scopus

View more at Scopus

Cited by Powered by Scopus

View more at Scopus

Cite

CITATION STYLE

APA

Jin, L., Wang, X., He, M., & Wang, J. (2021). Drnet: A depth-based regression network for 6d object pose estimation. Sensors, 21(5), 1–15. https://doi.org/10.3390/s21051692

Readers' Seniority

PhD / Post grad / Masters / Doc 6

100%

Readers' Discipline

Computer Science 4

67%

Engineering 2

33%

Drnet: A depth-based regression network for 6d object pose estimation

Abstract

Author supplied keywords

References Powered by Scopus

Deep residual learning for image recognition

Random sample consensus: A Paradigm for Model Fitting with Applications to Image Analysis and Automated Cartography

A Method for Registration of 3-D Shapes

Cited by Powered by Scopus

BDR6D: Bidirectional Deep Residual Fusion Network for 6D Pose Estimation

A lightweight convolutional neural network for pose estimation of a planar model

DOPE++: 6D pose estimation algorithm for weakly textured objects based on deep neural networks

Register to see more suggestions

Cite

Readers' Seniority

Readers' Discipline