Concurrent hierarchical reinforcement learning for robocup keepaway

Aijun Bai; Stuart Russell; Xiaoping Chen

Conference ProceedingsOPEN ACCESS

Concurrent hierarchical reinforcement learning for robocup keepaway

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2018) 11175 LNAI 190-203

DOI: 10.1007/978-3-030-00308-1_16

0Citations

14Readers

Abstract

RoboCup Keepaway, originated from the RoboCup soccer simulation 2D challenge, has been widely used as a machine learning benchmark. In this paper, we present a concurrent hierarchical reinforcement learning approach to RoboCup Keepaway. Following the idea of hierarchies of abstract machines (HAMs), we write a partial policy as a HAM from the perspective of a single keeper, run multiple instances of the HAM, and use reinforcement learning to learn the optimal completion of the resulting joint HAM. Furthermore, we apply the idea of exploiting the intrinsic internal transitions within the HAM structure for more efficient learning. Experimental results confirm that the concurrent HAM approaches outperform the state of the art significantly on the very complex RoboCup Keepaway domain.

Author supplied keywords

Cite

CITATION STYLE

APA

Bai, A., Russell, S., & Chen, X. (2018). Concurrent hierarchical reinforcement learning for robocup keepaway. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 11175 LNAI, pp. 190–203). Springer Verlag. https://doi.org/10.1007/978-3-030-00308-1_16

Concurrent hierarchical reinforcement learning for robocup keepaway

Abstract

Author supplied keywords

Cite

Register to see more suggestions