Analysis and optimization of deep counterfactual value networks

Patryk Hopner; Eneldo Loza Mencía

Conference Proceedings

Analysis and optimization of deep counterfactual value networks

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2018) 11117 LNAI 305-312

DOI: 10.1007/978-3-030-00111-7_26

N/ACitations

6Readers

Get full text

Abstract

Recently a strong poker-playing algorithm called DeepStack was published, which is able to find an approximate Nash equilibrium during gameplay by using heuristic values of future states predicted by deep neural networks. This paper analyzes new ways of encoding the inputs and outputs of DeepStack’s deep counterfactual value networks based on traditional abstraction techniques, as well as an unabstracted encoding, which was able to increase the network’s accuracy.

Author supplied keywords

Cite

CITATION STYLE

APA

Hopner, P., & Loza Mencía, E. (2018). Analysis and optimization of deep counterfactual value networks. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 11117 LNAI, pp. 305–312). Springer Verlag. https://doi.org/10.1007/978-3-030-00111-7_26

Analysis and optimization of deep counterfactual value networks

Abstract

Author supplied keywords

Cite

Register to see more suggestions