Deep Reinforcement Learning with Corrective Feedback for Autonomous UAV Landing on a Mobile Platform

Lizhen Wu; Chang Wang; Pengpeng Zhang; Changyun Wei

Journal ArticleOPEN ACCESS

Deep Reinforcement Learning with Corrective Feedback for Autonomous UAV Landing on a Mobile Platform

Drones (2022) 6(9)

DOI: 10.3390/drones6090238

30Citations

19Readers

Abstract

Autonomous Unmanned Aerial Vehicle (UAV) landing remains a challenge in uncertain environments, e.g., landing on a mobile ground platform such as an Unmanned Ground Vehicle (UGV) without knowing its motion dynamics. A traditional PID (Proportional, Integral, Derivative) controller is a choice for the UAV landing task, but it suffers the problem of manual parameter tuning, which becomes intractable if the initial landing condition changes or the mobile platform keeps moving. In this paper, we design a novel learning-based controller that integrates a standard PID module with a deep reinforcement learning module, which can automatically optimize the PID parameters for velocity control. In addition, corrective feedback based on heuristics of parameter tuning can speed up the learning process compared with traditional DRL algorithms that are typically time-consuming. In addition, the learned policy makes the UAV landing smooth and fast by allowing the UAV to adjust its speed adaptively according to the dynamics of the environment. We demonstrate the effectiveness of the proposed algorithm in a variety of quadrotor UAV landing tasks with both static and dynamic environmental settings.

Author supplied keywords

Cite

CITATION STYLE

APA

Wu, L., Wang, C., Zhang, P., & Wei, C. (2022). Deep Reinforcement Learning with Corrective Feedback for Autonomous UAV Landing on a Mobile Platform. Drones, 6(9). https://doi.org/10.3390/drones6090238

Deep Reinforcement Learning with Corrective Feedback for Autonomous UAV Landing on a Mobile Platform

Abstract

Author supplied keywords

Cite

Register to see more suggestions