A Reinforcement Learning Approach for Autonomous Control and Landing of a Quadrotor

Citations of this article
Mendeley users who have this article in their library.
Get full text


This paper looks into the problem of precise autonomous landing of an Unmanned Aerial Vehicle (UAV) which is considered to be a difficult problem as one has to generate appropriate landing trajectories in presence of dynamic constraints, such as, sudden changes in wind velocities and directions, downwash effects, change in payload etc. The problem is further compounded due to uncertainties arising from inaccurate model information and noisy sensor readings. The problem is partially solved by proposing a Reinforcement Learning (RL) based controller that uses Least Square Policy Iteration (LSPI) to learn the optimal control policies required for generating these trajectories. The efficacy of the approach is demonstrated through both simulation and real-world experiments with actual Parrot AR drone 2.0. According to our study, this is the first time such experimental results have been presented using RL based controller for drone landing, making it a novel contribution in this field.




Vankadari, M. B., Das, K., Shinde, C., & Kumar, S. (2018). A Reinforcement Learning Approach for Autonomous Control and Landing of a Quadrotor. In 2018 International Conference on Unmanned Aircraft Systems, ICUAS 2018 (pp. 676–683). Institute of Electrical and Electronics Engineers Inc. https://doi.org/10.1109/ICUAS.2018.8453468

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free