Training a model-free reinforcement learning controller for a 3-degree-of-freedom helicopter under multiple constraints

Shengri Xue; Zhan Li; Liu Yang

Journal ArticleOPEN ACCESS

Training a model-free reinforcement learning controller for a 3-degree-of-freedom helicopter under multiple constraints

Measurement and Control (United Kingdom) (2019) 52(7-8) 844-854

DOI: 10.1177/0020294019847711

11Citations

9Readers

Abstract

The purpose of the article is to design data-driven attitude controllers for a 3-degree-of-freedom experimental helicopter under multiple constraints. Controllers were updated by utilizing the reinforcement learning technique. The 3-degree-of-freedom helicopter platform is an approximation to a practical helicopter attitude control system, which includes realistic features such as complicated dynamics, coupling and uncertainties. The method in this paper first describes the training environment, which consists of user-defined constraints and performance expectations by using a reward function module. Then, actor–critic-based controllers were designed for helicopter elevation and pitch axis. Next, the policy gradient method, which is an important branch of the reinforcement learning algorithms, is utilized to train the networks and optimize controllers. Finally, from experimental results acquired by the 3-degree-of-freedom helicopter platform, the advantages of the proposed method are illustrated by satisfying multiple control constraints.

Author supplied keywords

Cite

CITATION STYLE

APA

Xue, S., Li, Z., & Yang, L. (2019). Training a model-free reinforcement learning controller for a 3-degree-of-freedom helicopter under multiple constraints. Measurement and Control (United Kingdom), 52(7–8), 844–854. https://doi.org/10.1177/0020294019847711

Training a model-free reinforcement learning controller for a 3-degree-of-freedom helicopter under multiple constraints

Abstract

Author supplied keywords

Cite

Register to see more suggestions