Imitation learning from video by leveraging proprioception

14Citations
Citations of this article
50Readers
Mendeley users who have this article in their library.

Abstract

Classically, imitation learning algorithms have been developed for idealized situations, e.g., the demonstrations are often required to be collected in the exact same environment and usually include the demonstrator's actions. Recently, however, the research community has begun to address some of these shortcomings by offering algorithmic solutions that enable imitation learning from observation (IfO), e.g., learning to perform a task from visual demonstrations that may be in a different environment and do not include actions. Motivated by the fact that agents often also have access to their own internal states (i.e., proprioception), we propose and study an IfO algorithm that leverages this information in the policy learning process. The proposed architecture learns policies over proprioceptive state representations and compares the resulting trajectories visually to the demonstration data. We experimentally test the proposed technique on several MuJoCo domains and show that it outperforms other imitation from observation algorithms by a large margin.

Cite

CITATION STYLE

APA

Torabi, F., Warnell, G., & Stone, P. (2019). Imitation learning from video by leveraging proprioception. In IJCAI International Joint Conference on Artificial Intelligence (Vol. 2019-August, pp. 3585–3591). International Joint Conferences on Artificial Intelligence. https://doi.org/10.24963/ijcai.2019/497

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free