Learning a simulation-based visual policy for real-world peg in unseen holes

2Citations
Citations of this article
5Readers
Mendeley users who have this article in their library.

Abstract

This paper proposes a learning-based visual peg-in-hole that enables training with several shapes in simulation and adapting to arbitrary unseen shapes in the real world with minimal sim-to-real cost. The core idea is to decouple the generalization of the sensory-motor policy from the design of a fast-adaptable perception module and a simulated generic policy module. The framework consists of a segmentation network (SN), a virtual sensor network (VSN), and a controller network (CN). Concretely, the VSN is trained to measure the pose of the unseen shape from a segmented image. After that, given the shape-agnostic pose measurement, the CN is trained to achieve a generic peg-in-hole. Finally, when applying to real unseen holes, we only have to fine-tune the SN required by the simulated VSN + CN. To further minimize the transfer cost, we propose to automatically collect and annotate the data for the SN after one-minute human teaching. Simulated and real-world results are presented under the configuration of eye-to/in-hand. An electric vehicle charging system with the proposed policy inside achieves a 10/10 success rate in 2-3 s, using only hundreds of auto-labeled samples for the SN transfer.

Cite

CITATION STYLE

APA

Xie, L., Yu, H., Xu, K., Yang, T., Wang, M., Lu, H., … Wang, Y. (2023). Learning a simulation-based visual policy for real-world peg in unseen holes. Review of Scientific Instruments, 94(10). https://doi.org/10.1063/5.0168544

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free