MW-MADDPG: a meta-learning based decision-making method for collaborative UAV swarm

Minrui Zhao; Gang Wang; Qiang Fu; Xiangke Guo; Yu Chen; Tengda Li; Xiang Yu Liu

Journal ArticleOPEN ACCESS

MW-MADDPG: a meta-learning based decision-making method for collaborative UAV swarm

Frontiers in Neurorobotics (2023) 17

DOI: 10.3389/fnbot.2023.1243174

6Citations

8Readers

Abstract

Unmanned Aerial Vehicles (UAVs) have gained popularity due to their low lifecycle cost and minimal human risk, resulting in their widespread use in recent years. In the UAV swarm cooperative decision domain, multi-agent deep reinforcement learning has significant potential. However, current approaches are challenged by the multivariate mission environment and mission time constraints. In light of this, the present study proposes a meta-learning based multi-agent deep reinforcement learning approach that provides a viable solution to this problem. This paper presents an improved MAML-based multi-agent deep deterministic policy gradient (MADDPG) algorithm that achieves an unbiased initialization network by automatically assigning weights to meta-learning trajectories. In addition, a Reward-TD prioritized experience replay technique is introduced, which takes into account immediate reward and TD-error to improve the resilience and sample utilization of the algorithm. Experiment results show that the proposed approach effectively accomplishes the task in the new scenario, with significantly improved task success rate, average reward, and robustness compared to existing methods.

Author supplied keywords

Cite

CITATION STYLE

APA

Zhao, M., Wang, G., Fu, Q., Guo, X., Chen, Y., Li, T., & Liu, X. Y. (2023). MW-MADDPG: a meta-learning based decision-making method for collaborative UAV swarm. Frontiers in Neurorobotics, 17. https://doi.org/10.3389/fnbot.2023.1243174

MW-MADDPG: a meta-learning based decision-making method for collaborative UAV swarm

Abstract

Author supplied keywords

Cite

Register to see more suggestions