B-APFDQN: A UAV Path Planning Algorithm Based on Deep Q-Network and Artificial Potential Field

20Citations
Citations of this article
19Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

Deep Q-network (DQN) is one of the standard methods to solve the Unmanned Aerial Vehicle (UAV) path planning problem. However, the way agent deepens its cognition of the environment through frequent random trial-and-error leads to slow convergence. This paper proposes an optimized DQN with Artificial Potential Field (APF) as prior knowledge called B-APFDQN for path planning. Replacing the traditional neural network which has only one Q-value output with a multi-output neural network to promote the training process in combination with APF. Furthermore, a SA- ϵ-greedy algorithm that can automatically adjust the stochastic exploration frequency with steps and successes is proposed in order to prevent the agent from falling into local optimum. We remove the nodes that do not affect the path connectivity and apply the B-spline algorithm to make the path shorter and smoother. Simulation experiments show that the proposed B-APFDQN algorithm performs better than the classical DQN, has a strong ability to avoid falling into local optimum, and the obtained paths are smooth and shorter, which proves the effectiveness of B-APFDQN in the UAV path planning problem.

Cite

CITATION STYLE

APA

Kong, F., Wang, Q., Gao, S., & Yu, H. (2023). B-APFDQN: A UAV Path Planning Algorithm Based on Deep Q-Network and Artificial Potential Field. IEEE Access, 11, 44051–44064. https://doi.org/10.1109/ACCESS.2023.3273164

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free