Optimistic and Topological Value Iteration for Simple Stochastic Games

Muqsit Azeem; Alexandros Evangelidis; Jan Křetínský; Alexander Slivinskiy; Maximilian Weininger

Conference Proceedings

Optimistic and Topological Value Iteration for Simple Stochastic Games

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2022) 13505 LNCS 285-302

DOI: 10.1007/978-3-031-19992-9_18

1Citations

5Readers

Get full text

Abstract

While value iteration (VI) is a standard solution approach to simple stochastic games (SSGs), it suffered from the lack of a stopping criterion. Recently, several solutions have appeared, among them also “optimistic” VI (OVI). However, OVI is applicable only to one-player SSGs with no end components. We lift these two assumptions, making it available to general SSGs. Further, we utilize the idea in the context of topological VI, where we provide an efficient precise solution. In order to compare the new algorithms with the state of the art, we use not only the standard benchmarks, but we also design a random generator of SSGs, which can be biased towards various types of models, aiding in understanding the advantages of different algorithms on SSGs.

Cite

CITATION STYLE

APA

Azeem, M., Evangelidis, A., Křetínský, J., Slivinskiy, A., & Weininger, M. (2022). Optimistic and Topological Value Iteration for Simple Stochastic Games. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 13505 LNCS, pp. 285–302). Springer Science and Business Media Deutschland GmbH. https://doi.org/10.1007/978-3-031-19992-9_18

Optimistic and Topological Value Iteration for Simple Stochastic Games

Abstract

Cite

Register to see more suggestions