DexMV: Imitation Learning for Dexterous Manipulation from Human Videos

Yuzhe Qin; Yueh Hua Wu; Shaowei Liu; Hanwen Jiang; Ruihan Yang; Yang Fu; Xiaolong Wang

Conference Proceedings

DexMV: Imitation Learning for Dexterous Manipulation from Human Videos

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2022) 13699 LNCS 570-587

DOI: 10.1007/978-3-031-19842-7_33

13Citations

74Readers

Get full text

Abstract

While significant progress has been made on understanding hand-object interactions in computer vision, it is still very challenging for robots to perform complex dexterous manipulation. In this paper, we propose a new platform and pipeline DexMV (Dexterous Manipulation from Videos) for imitation learning. We design a platform with: (i) a simulation system for complex dexterous manipulation tasks with a multi-finger robot hand and (ii) a computer vision system to record large-scale demonstrations of a human hand conducting the same tasks. In our novel pipeline, we extract 3D hand and object poses from videos, and propose a novel demonstration translation method to convert human motion to robot demonstrations. We then apply and benchmark multiple imitation learning algorithms with the demonstrations. We show that the demonstrations can indeed improve robot learning by a large margin and solve the complex tasks which reinforcement learning alone cannot solve. Code and videos are available at https://yzqin.github.io/dexmv

Author supplied keywords

Cite

CITATION STYLE

APA

Qin, Y., Wu, Y. H., Liu, S., Jiang, H., Yang, R., Fu, Y., & Wang, X. (2022). DexMV: Imitation Learning for Dexterous Manipulation from Human Videos. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 13699 LNCS, pp. 570–587). Springer Science and Business Media Deutschland GmbH. https://doi.org/10.1007/978-3-031-19842-7_33

DexMV: Imitation Learning for Dexterous Manipulation from Human Videos

Abstract

Author supplied keywords

Cite

Register to see more suggestions