Solving Dynamic Traveling Salesman Problems With Deep Reinforcement Learning

155Citations
Citations of this article
83Readers
Mendeley users who have this article in their library.
Get full text

Abstract

A traveling salesman problem (TSP) is a well-known NP-complete problem. Traditional TSP presumes that the locations of customers and the traveling time among customers are fixed and constant. In real-life cases, however, the traffic conditions and customer requests may change over time. To find the most economic route, the decisions can be made constantly upon the time-point when the salesman completes his service of each customer. This brings in a dynamic version of the traveling salesman problem (DTSP), which takes into account the information of real-time traffic and customer requests. DTSP can be extended to a dynamic pickup and delivery problem (DPDP). In this article, we ameliorate the attention model to make it possible to perceive environmental changes. A deep reinforcement learning algorithm is proposed to solve DTSP and DPDP instances with a size of up to 40 customers in 100 locations. Experiments show that our method can capture the dynamic changes and produce a highly satisfactory solution within a very short time. Compared with other baseline approaches, more than 5% improvements can be observed in many cases.

Cite

CITATION STYLE

APA

Zhang, Z., Liu, H., Zhou, M. C., & Wang, J. (2023). Solving Dynamic Traveling Salesman Problems With Deep Reinforcement Learning. IEEE Transactions on Neural Networks and Learning Systems, 34(4), 2119–2132. https://doi.org/10.1109/TNNLS.2021.3105905

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free