Palamedes is an ongoing project for building expert playing bots that can play backgammon variants. As in all successful modern backgammon programs, it is based on neural networks trained using temporal difference learning. This paper improves upon the training method that we used in our previous approach for the two backgammon variants popular in Greece and neighboring countries, Plakoto and Fevga. We show that the proposed methods result both in faster learning as well as better performance. We also present insights into the selection of the features in our experiments that can be useful to temporal difference learning in other games as well. © 2012 Springer-Verlag.
CITATION STYLE
Papahristou, N., & Refanidis, I. (2012). Improving temporal difference learning performance in backgammon variants. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 7168 LNCS, pp. 134–145). https://doi.org/10.1007/978-3-642-31866-5_12
Mendeley helps you to discover research relevant for your work.