This paper describes the design and implementation of a 4 player RoboCup Simulation 2D team, which was build by adding Heuristic: Accelerated Reinforcement Learning capabilities to basic players of the well-known UvA Trilearn team. The implemented agents learn by using a recently proposed Heuristic Heinforcement Learning algorithm, the Heuristically Accelerated Q-Learning (HAQL), which allows the use of heuristics to speed up the well-known Reinforcement Learning algorithm Q-Learning. A set of empirical evaluations was conducted in the RoboCup 2D Simulator, and experimental results obtained while playing with other teams shows that the approach adopted hero is very promising. © Springer-Verlag Berlin Heidelberg 2007.
CITATION STYLE
Celiberto, L. A., Matsmura, J., & Bianchi, R. A. C. (2007). Heuristic Q-learning soccer players: A new reinforcement learning approach to RoboCup simulation. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 4874 LNAI, pp. 520–529). Springer Verlag. https://doi.org/10.1007/978-3-540-77002-2_44
Mendeley helps you to discover research relevant for your work.