Adversarial actor-critic method for task and motion planning problems using planning experience

Beomjoon Kim; Leslie Pack Kaelbling; Tomás Lozano-Pérez

Conference ProceedingsOPEN ACCESS

Adversarial actor-critic method for task and motion planning problems using planning experience

33rd AAAI Conference on Artificial Intelligence, AAAI 2019, 31st Innovative Applications of Artificial Intelligence Conference, IAAI 2019 and the 9th AAAI Symposium on Educational Advances in Artificial Intelligence, EAAI 2019 (2019) 8017-8024

DOI: 10.1609/aaai.v33i01.33018017

16Citations

58Readers

Abstract

We propose an actor-critic algorithm that uses past planning experience to improve the efficiency of solving robot task-and-motion planning (TAMP) problems. TAMP planners search for goal-achieving sequences of high-level operator instances specified by both discrete and continuous parameters. Our algorithm learns a policy for selecting the continuous parameters during search, using a small training set generated from the search trees of previously solved instances. We also introduce a novel fixed-length vector representation for world states with varying numbers of objects with different shapes, based on a set of key robot configurations. We demonstrate experimentally that our method learns more efficiently from less data than standard reinforcement-learning approaches and that using a learned policy to guide a planner results in the improvement of planning efficiency.

Cite

CITATION STYLE

APA

Kim, B., Kaelbling, L. P., & Lozano-Pérez, T. (2019). Adversarial actor-critic method for task and motion planning problems using planning experience. In 33rd AAAI Conference on Artificial Intelligence, AAAI 2019, 31st Innovative Applications of Artificial Intelligence Conference, IAAI 2019 and the 9th AAAI Symposium on Educational Advances in Artificial Intelligence, EAAI 2019 (pp. 8017–8024). AAAI Press. https://doi.org/10.1609/aaai.v33i01.33018017

Adversarial actor-critic method for task and motion planning problems using planning experience

Abstract

Cite

Register to see more suggestions