Multi-criteria comparison of coevolution and temporal difference learning on Othello

Wojciech Jaśkowski; Marcin Szubert; Paweł Liskowski

Conference Proceedings

Multi-criteria comparison of coevolution and temporal difference learning on Othello

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2014) 8602 301-312

DOI: 10.1007/978-3-662-45523-4_25

5Citations

7Readers

Get full text

Abstract

We compare Temporal Difference Learning (TDL) with Coevolutionary Learning (CEL) on Othello. Apart from using three popular single-criteria performance measures: (i) generalization performance or expected utility, (ii) average results against a hand-crafted heuristic and (iii) result in a head to head match, we compare the algorithms using performance profiles. This multi-criteria performance measure characterizes player’s performance in the context of opponents of various strength. The multi-criteria analysis reveals that although the generalization performance of players produced by the two algorithms is similar, TDL is much better at playing against strong opponents, while CEL copes better against weak ones. We also find out that the TDL produces less diverse strategies than CEL. Our results confirms the usefulness of performance profiles as a tool for comparison of learning algorithms for games.

Author supplied keywords

Cite

CITATION STYLE

APA

Jaśkowski, W., Szubert, M., & Liskowski, P. (2014). Multi-criteria comparison of coevolution and temporal difference learning on Othello. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 8602, pp. 301–312). Springer Verlag. https://doi.org/10.1007/978-3-662-45523-4_25

Multi-criteria comparison of coevolution and temporal difference learning on Othello

Abstract

Author supplied keywords

Cite

Register to see more suggestions