On the policy iteration algorithm for nondegenerate controlled diffusions under the ergodic criterion

Ari Arapostathis

Book Chapter

On the policy iteration algorithm for nondegenerate controlled diffusions under the ergodic criterion

Arapostathis A

Birkhauser, (2012), 1-12

DOI: 10.1007/978-0-8176-8337-5_1

5Citations

2Readers

Get full text

Abstract

The subject of this chapter is the policy iteration algorithm for nondegenerate controlled diffusions. The results parallel the ones in Meyn (IEEE Trans Automat Control 42:1663–1680, 1997) for discrete-time controlled Markov chains. The model in (Meyn, IEEE Trans Automat Control 42:1663–1680, 1997) uses norm-like running costs, while we opt for the milder assumption of near-monotone costs. Also, instead of employing a blanket Lyapunov stability hypothesis, we provide a characterization of the region of attraction of the optimal control.

Cite

CITATION STYLE

APA

Arapostathis, A. (2012). On the policy iteration algorithm for nondegenerate controlled diffusions under the ergodic criterion. In Systems and Control: Foundations and Applications (pp. 1–12). Birkhauser. https://doi.org/10.1007/978-0-8176-8337-5_1

On the policy iteration algorithm for nondegenerate controlled diffusions under the ergodic criterion

Abstract

Cite

Register to see more suggestions