Learning algorithms for differential games of continuous-time systems

Derong Liu; Qinglai Wei; Ding Wang; Xiong Yang; Hongliang Li

Book Chapter

Learning algorithms for differential games of continuous-time systems

Springer International Publishing, (2017), 417-480

DOI: 10.1007/978-3-319-50815-3_11

1Citations

3Readers

Get full text

Abstract

In this chapter, differential games are studied for continuous-time linear and nonlinear systems, including two-player zero-sum games, multi-player zero-sum games, and multi-player nonzero-sum games, via a series of adaptive dynamic programming (ADP) approaches. First, an integral policy iteration algorithm is developed to learn online the Nash equilibrium of two-player zero-sum differential games with completely unknown continuous-time linear dynamics using the state and control data. Second, multi-player zero-sum differential games for a class of continuous-time uncertain nonlinear systems are solved by using a novel iterative ADP algorithm. Via neural network modeling for the system dynamics, the ADP technique is employed to obtain the optimal control pair iteratively so that the iterative value function reaches the optimal solution of the zero-sum differential games. Finally, an online synchronous approximate optimal learning algorithm based on policy iteration is developed to solve multi-player nonzero-sum games of continuous-time nonlinear systems without the requirement of exact knowledge of system dynamics.

Cite

CITATION STYLE

APA

Liu, D., Wei, Q., Wang, D., Yang, X., & Li, H. (2017). Learning algorithms for differential games of continuous-time systems. In Advances in Industrial Control (pp. 417–480). Springer International Publishing. https://doi.org/10.1007/978-3-319-50815-3_11

Learning algorithms for differential games of continuous-time systems

Abstract

Cite

Register to see more suggestions