We consider average Markov decision processes on countable state space, compact action space and with unbounded costs. Under a certain penalizing condition on the cost for unstable behavior, we establish the existence of a stable stationary strategy which is strong average optimal. © 1992.
Mendeley saves you time finding and organizing research
Choose a citation style from the tabs below