Constructing skill trees for reinforcement learning agents from demonstration trajectories

George Konidaris; Scott Kuindersmay; Andrew Barto; Roderic Grupen

Conference Proceedings

Constructing skill trees for reinforcement learning agents from demonstration trajectories

Advances in Neural Information Processing Systems 23: 24th Annual Conference on Neural Information Processing Systems 2010, NIPS 2010 (2010)

57Citations

175Readers

Abstract

We introduce CST, an algorithm for constructing skill trees from demonstration trajectories in continuous reinforcement learning domains. CST uses a changepoint detection method to segment each trajectory into a skill chain by detecting a change of appropriate abstraction, or that a segment is too complex to model as a single skill. The skill chains from each trajectory are then merged to form a skill tree. We demonstrate that CST constructs an appropriate skill tree that can be further refined through learning in a challenging continuous domain, and that it can be used to segment demonstration trajectories on a mobile manipulator into chains of skills where each skill is assigned an appropriate abstraction.

Cite

CITATION STYLE

APA

Konidaris, G., Kuindersmay, S., Barto, A., & Grupen, R. (2010). Constructing skill trees for reinforcement learning agents from demonstration trajectories. In Advances in Neural Information Processing Systems 23: 24th Annual Conference on Neural Information Processing Systems 2010, NIPS 2010. Neural Information Processing Systems.

Constructing skill trees for reinforcement learning agents from demonstration trajectories

Abstract

Cite

Register to see more suggestions