Risk Sensitive Stochastic Shortest Path and LogSumExp: From Theory to Practice

0Citations
Citations of this article
4Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Stochastic Shortest Path (SSP) is the most popular framework to model sequential decision-making problems under stochasticity. However, decisions for real problems should consider risk sensitivity to provide robust decisions taking into account bad scenarios. SSPs that deal with risk are called Risk-Sensitive SSPs (RSSSPs), and an interesting framework from a theoretical perspective considers Expected Utility Theory under an exponential utility function. However, from a practical perspective, exponential utility function causes overflow or underflow in computer implementation even in small state spaces. In this paper, we make use of LogSumExp technique to solve RSSSPs under exponential utility in practice within Value Iteration, Policy Iteration, and Linear Programming algorithms. Experiments were performed on a toy problem to show scalability of the proposed algorithms.

Cite

CITATION STYLE

APA

de Freitas, E. M., Freire, V., & Delgado, K. V. (2020). Risk Sensitive Stochastic Shortest Path and LogSumExp: From Theory to Practice. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 12320 LNAI, pp. 123–139). Springer Science and Business Media Deutschland GmbH. https://doi.org/10.1007/978-3-030-61380-8_9

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free