Improving temporal difference learning performance in backgammon variants

4Citations
Citations of this article
3Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Palamedes is an ongoing project for building expert playing bots that can play backgammon variants. As in all successful modern backgammon programs, it is based on neural networks trained using temporal difference learning. This paper improves upon the training method that we used in our previous approach for the two backgammon variants popular in Greece and neighboring countries, Plakoto and Fevga. We show that the proposed methods result both in faster learning as well as better performance. We also present insights into the selection of the features in our experiments that can be useful to temporal difference learning in other games as well. © 2012 Springer-Verlag.

Cite

CITATION STYLE

APA

Papahristou, N., & Refanidis, I. (2012). Improving temporal difference learning performance in backgammon variants. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 7168 LNCS, pp. 134–145). https://doi.org/10.1007/978-3-642-31866-5_12

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free