Reinforcement learning policy with proportional-integral control

Ye Huang; Chaochen Gu; Kaijie Wu; Xinping Guan

Conference Proceedings

Reinforcement learning policy with proportional-integral control

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2018) 11303 LNCS 253-264

DOI: 10.1007/978-3-030-04182-3_23

1Citations

6Readers

Get full text

Abstract

Deep Reinforcement Learning has made impressive advances in sequential decision making problems recently. Constructive reinforcement learning (RL) algorithms have been proposed to focus on the policy optimization process, while further research on different network architectures of the policy has not been fully explored. MLPs, LSTMs and linear layer are complementary in their controlling capabilities, as MLPs are appropriate for global control, LSTMs are able to exploit history information and linear layer is good at stabilizing system dynamics. In this paper, we propose a “Proportional-Integral” (PI) neural network architecture that could be easily combined with popular optimization algorithms. This PI-patterned policy network obtains the advantages of integral control and linear control that are widely applied in classic control systems, improving the sample efficiency and training performance on most RL tasks. Experimental results on public RL simulation platforms demonstrate the proposed architecture could achieve better performance than generally used MLP and other existing applied models.

Author supplied keywords

Cite

CITATION STYLE

APA

Huang, Y., Gu, C., Wu, K., & Guan, X. (2018). Reinforcement learning policy with proportional-integral control. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 11303 LNCS, pp. 253–264). Springer Verlag. https://doi.org/10.1007/978-3-030-04182-3_23

Reinforcement learning policy with proportional-integral control

Abstract

Author supplied keywords

Cite

Register to see more suggestions