Imitation reinforcement learning-based remote rotary inverted pendulum control in openflow network

33Citations
Citations of this article
45Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

Rotary inverted pendulum is an unstable and highly nonlinear device and has been used as a common application model in nonlinear control engineering field. In this paper, we use a rotary inverted pendulum as a deep reinforcement learning environment. The real device is composed of a cyber environment and physical environment based on the OpenFlow network, and the MQTT protocol is used on the Ethernet connection to connect the cyber environment and the physical environment. The reinforcement learning agent is learned to control the real device located remotely from the controller, and the classical PID controller is also utilized to implement the imitation reinforcement learning and facilitate the learning process. From our CPS-based experimental system, we verify that a deep reinforcement learning agent can successfully control the real device located remotely from the agent, and our imitation learning strategy can make the learning time reduced effectively.

Cite

CITATION STYLE

APA

Kim, J. B., Lim, H. K., Kim, C. M., Kim, M. S., Hong, Y. G., & Han, Y. H. (2019). Imitation reinforcement learning-based remote rotary inverted pendulum control in openflow network. IEEE Access, 7, 36682–36690. https://doi.org/10.1109/ACCESS.2019.2905621

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free