We study continuous-time controlled Markov chains on the finite horizon. For the Markov decision problem, we show that the value function is the unique solution of the corresponding dynamic programming equation. This leads to the existence of an optimal Markov control. We then consider a zero-sum game. We show that the value function exists and is the unique solution of the corresponding Isaacs equations. This yields the existence of a pair of saddle point Markov strategies.
CITATION STYLE
Ghosh, M. K., & Saha, S. (2012). Continuous-time controlled jump Markov processes on the finite horizon. In Systems and Control: Foundations and Applications (pp. 99–109). Birkhauser. https://doi.org/10.1007/978-0-8176-8337-5_6
Mendeley helps you to discover research relevant for your work.