Deep Reinforcement Learning with a Stage Incentive Mechanism of Dense Reward for Robotic Trajectory Planning

Gang Peng; Jin Yang; Xinde Li; Mohammad Omar Khyam

Journal ArticleOPEN ACCESS

Deep Reinforcement Learning with a Stage Incentive Mechanism of Dense Reward for Robotic Trajectory Planning

IEEE Transactions on Systems, Man, and Cybernetics: Systems (2023) 53(6) 3566-3573

DOI: 10.1109/TSMC.2022.3228901

8Citations

14Readers

Abstract

To improve the efficiency of deep reinforcement learning (DRL)-based methods for robot manipulator trajectory planning in random working environments, we present three dense reward functions. These rewards differ from the traditional sparse reward. First, a posture reward function is proposed to speed up the learning process with a more reasonable trajectory by modeling the distance and direction constraints, which can reduce the blindness of exploration. Second, a stride reward function is proposed to improve the stability of the learning process by modeling the distance and movement distance of joint constraints. Finally, in order to further improve learning efficiency, we are inspired by the cognitive process of human behavior and propose a stage incentive mechanism, including a hard-stage incentive reward function and a soft-stage incentive reward function. Extensive experiments show that the soft-stage incentive reward function is able to improve the convergence rate, get higher mean reward and lower standard deviation after convergence.

Author supplied keywords

Cite

CITATION STYLE

APA

Peng, G., Yang, J., Li, X., & Khyam, M. O. (2023). Deep Reinforcement Learning with a Stage Incentive Mechanism of Dense Reward for Robotic Trajectory Planning. IEEE Transactions on Systems, Man, and Cybernetics: Systems, 53(6), 3566–3573. https://doi.org/10.1109/TSMC.2022.3228901

Deep Reinforcement Learning with a Stage Incentive Mechanism of Dense Reward for Robotic Trajectory Planning

Abstract

Author supplied keywords

Cite

Register to see more suggestions