Imitation learning using graphical models

Deepak Verma; Rajesh P.N. Rao

Conference ProceedingsOPEN ACCESS

Imitation learning using graphical models

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2007) 4701 LNAI 757-764

DOI: 10.1007/978-3-540-74958-5_77

8Citations

12Readers

Abstract

Imitation-based learning is a general mechanism for rapid acquisition of new behaviors in autonomous agents and robots. In this paper, we propose a new approach to learning by imitation based on parameter learning in probabilistic graphical models. Graphical models are used not only to model an agent's own dynamics but also the dynamics of an observed teacher. Parameter tying between the agent-teacher models ensures consistency and facilitates learning. Given only observations of the teacher's states, we use the expectation-maximization (EM) algorithm to learn both dynamics and policies within graphical models. We present results demonstrating that EM-based imitation learning out performs pure exploration-based learning on a benchmark problem (the Flag World domain). We additionally show that the graphical model representation can be leveraged to incorporate domain knowledge (e.g., state space factoring) to achieve significant speed-up in learning. © Springer-Verlag Berlin Heidelberg 2007.

Cite

CITATION STYLE

APA

Verma, D., & Rao, R. P. N. (2007). Imitation learning using graphical models. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 4701 LNAI, pp. 757–764). Springer Verlag. https://doi.org/10.1007/978-3-540-74958-5_77

Imitation learning using graphical models

Abstract

Cite

Register to see more suggestions