Contractivity of Bellman operator in risk averse dynamic programming with infinite horizon

Miloš Kopa; Martin Šmíd

Journal Article

Contractivity of Bellman operator in risk averse dynamic programming with infinite horizon

Operations Research Letters (2023) 51(2) 133-136

DOI: 10.1016/j.orl.2023.01.008

0Citations

2Readers

Get full text

Abstract

The paper deals with a risk averse dynamic programming problem with infinite horizon. First, the required assumptions are formulated to have the problem well defined. Then the Bellman equation is derived, which may be also seen as a standalone reinforcement learning problem. The fact that the Bellman operator is contraction is proved, guaranteeing convergence of various solution algorithms used for dynamic programming as well as reinforcement learning problems, which we demonstrate on the value iteration and the policy iteration algorithms.

Author supplied keywords

Cite

CITATION STYLE

APA

Kopa, M., & Šmíd, M. (2023). Contractivity of Bellman operator in risk averse dynamic programming with infinite horizon. Operations Research Letters, 51(2), 133–136. https://doi.org/10.1016/j.orl.2023.01.008

Contractivity of Bellman operator in risk averse dynamic programming with infinite horizon

Abstract

Author supplied keywords

Cite

Register to see more suggestions