Improving temporal difference learning performance in backgammon variants

Nikolaos Papahristou; Ioannis Refanidis

Conference Proceedings

Improving temporal difference learning performance in backgammon variants

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2012) 7168 LNCS 134-145

DOI: 10.1007/978-3-642-31866-5_12

4Citations

3Readers

Get full text

Abstract

Palamedes is an ongoing project for building expert playing bots that can play backgammon variants. As in all successful modern backgammon programs, it is based on neural networks trained using temporal difference learning. This paper improves upon the training method that we used in our previous approach for the two backgammon variants popular in Greece and neighboring countries, Plakoto and Fevga. We show that the proposed methods result both in faster learning as well as better performance. We also present insights into the selection of the features in our experiments that can be useful to temporal difference learning in other games as well. © 2012 Springer-Verlag.

Cite

CITATION STYLE

APA

Papahristou, N., & Refanidis, I. (2012). Improving temporal difference learning performance in backgammon variants. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 7168 LNCS, pp. 134–145). https://doi.org/10.1007/978-3-642-31866-5_12

Improving temporal difference learning performance in backgammon variants

Abstract

Cite

Register to see more suggestions