On time with minimal expected cost!

Alexandre David; Peter G. Jensen; Kim Guldstrand Larsen; Axel Legay; Didier Lime; Mathias Grund Soørensen; Jakob H. Taankvist

Conference Proceedings

On time with minimal expected cost!

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2014) 8837 129-145

DOI: 10.1007/978-3-319-11936-6_10

45Citations

10Readers

Get full text

Abstract

(Priced) timed games are two-player quantitative games involving an environment assumed to be completely antogonistic. Classical analysis consists in the synthesis of strategies ensuring safety, time-bounded or cost-bounded reachability objectives. Assuming a randomized environment, the (priced) timed game essentially defines an infinite-state Markov (reward) decision proces. In this setting the objective is classically to find a strategy that will minimize the expected reachability cost, but with no guarantees on worst-case behaviour. In this paper, we provide efficient methods for computing reachability strategies that will both ensure worst case time-bounds as well as provide (near-) minimal expected cost. Our method extends the synthesis algorithms of the synthesis tool Uppaal-Tiga with suitable adapted reinforcement learning techniques, that exhibits several orders of magnitude improvements w.r.t. previously known automated methods.

Cite

CITATION STYLE

APA

David, A., Jensen, P. G., Larsen, K. G., Legay, A., Lime, D., Soørensen, M. G., & Taankvist, J. H. (2014). On time with minimal expected cost! In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 8837, pp. 129–145). Springer Verlag. https://doi.org/10.1007/978-3-319-11936-6_10

On time with minimal expected cost!

Abstract

Cite

Register to see more suggestions