Markovian Decision Processes with Compact Action Spaces

  • Furukawa N
N/ACitations
Citations of this article
7Readers
Mendeley users who have this article in their library.

Abstract

We consider the problem of maximizing the expectation of the discounted total reward in Markovian decision processes with arbitrary state space and compact action space varying with the state. We get the existence theorem for a (p, ε)-optimal stationary policy, and the relation between the optimality of a policy and the optimality equation. Assuming the action space is a compact subset of n-dimensional Euclidean space, the existence of an optimal stationary policy is established, and an algorithm is obtained for finding the optimal policy. The last two facts are based on the Borel implicit function lemma given in this paper. CR - Copyright © 1972 Institute of Mathematical Statistics

Cite

CITATION STYLE

APA

Furukawa, N. (1972). Markovian Decision Processes with Compact Action Spaces. The Annals of Mathematical Statistics, 43(5), 1612–1622. https://doi.org/10.1214/aoms/1177692393

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free