Research on reinforcement learning-based safe decision-making methodology for multiple unmanned aerial vehicles

Longfei Yue; Rennong Yang; Ying Zhang; Jialiang Zuo

Journal ArticleOPEN ACCESS

Research on reinforcement learning-based safe decision-making methodology for multiple unmanned aerial vehicles

Frontiers in Neurorobotics (2023) 16

DOI: 10.3389/fnbot.2022.1105480

9Citations

20Readers

Abstract

A system with multiple cooperating unmanned aerial vehicles (multi-UAVs) can use its advantages to accomplish complicated tasks. Recent developments in deep reinforcement learning (DRL) offer good prospects for decision-making for multi-UAV systems. However, the safety and training efficiencies of DRL still need to be improved before practical use. This study presents a transfer-safe soft actor-critic (TSSAC) for multi-UAV decision-making. Decision-making by each UAV is modeled with a constrained Markov decision process (CMDP), in which safety is constrained to maximize the return. The soft actor-critic-Lagrangian (SAC-Lagrangian) algorithm is combined with a modified Lagrangian multiplier in the CMDP model. Moreover, parameter-based transfer learning is used to enable cooperative and efficient training of the tasks to the multi-UAVs. Simulation experiments indicate that the proposed method can improve the safety and training efficiencies and allow the UAVs to adapt to a dynamic scenario.

Author supplied keywords

Cite

CITATION STYLE

APA

Yue, L., Yang, R., Zhang, Y., & Zuo, J. (2023). Research on reinforcement learning-based safe decision-making methodology for multiple unmanned aerial vehicles. Frontiers in Neurorobotics, 16. https://doi.org/10.3389/fnbot.2022.1105480

Research on reinforcement learning-based safe decision-making methodology for multiple unmanned aerial vehicles

Abstract

Author supplied keywords

Cite

Register to see more suggestions