Finite State and Action MDPS

Lodewijk Kallenberg

Book Chapter

Finite State and Action MDPS

Kallenberg L

DOI: 10.1007/978-1-4615-0805-2_2

N/ACitations

9Readers

Get full text

Abstract

In this chapter we study Markov decision processes (MDPs) with finite state and action spaces. This is the classical theory developed since the end of the fifties. We consider finite and infinite horizon models. For the finite horizon model the utility function of the total expected reward is commonly used. For the infinite horizon the utility function is less obvious. We consider several criteria: total discounted expected reward, average expected reward and more sensitive optimality criteria including the Blackwell optimality criterion. We end with a variety of other subjects. The emphasis is on computational methods to compute optimal policies for these criteria. These methods are based on concepts like value iteration, policy iteration and linear programming. This survey covers about three hundred papers. Although the subject of finite state and action MDPs is classical, there are still open problems. We also mention some of them

Cite

CITATION STYLE

APA

Kallenberg, L. (2003). Finite State and Action MDPS (pp. 21–87). https://doi.org/10.1007/978-1-4615-0805-2_2

Finite State and Action MDPS

Abstract

Cite

Register to see more suggestions