Imitation-based learning is a general mechanism for rapid acquisition of new behaviors in autonomous agents and robots. In this paper, we propose a new approach to learning by imitation based on parameter learning in probabilistic graphical models. Graphical models are used not only to model an agent's own dynamics but also the dynamics of an observed teacher. Parameter tying between the agent-teacher models ensures consistency and facilitates learning. Given only observations of the teacher's states, we use the expectation-maximization (EM) algorithm to learn both dynamics and policies within graphical models. We present results demonstrating that EM-based imitation learning out performs pure exploration-based learning on a benchmark problem (the Flag World domain). We additionally show that the graphical model representation can be leveraged to incorporate domain knowledge (e.g., state space factoring) to achieve significant speed-up in learning. © Springer-Verlag Berlin Heidelberg 2007.
CITATION STYLE
Verma, D., & Rao, R. P. N. (2007). Imitation learning using graphical models. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 4701 LNAI, pp. 757–764). Springer Verlag. https://doi.org/10.1007/978-3-540-74958-5_77
Mendeley helps you to discover research relevant for your work.