Reinforcement learning for the traveling salesman problem with refueling

André L.C. Ottoni; Erivelton G. Nepomuceno; Marcos S.de Oliveira; Daniela C.R.de Oliveira

Journal ArticleOPEN ACCESS

Reinforcement learning for the traveling salesman problem with refueling

Complex and Intelligent Systems (2022) 8(3) 2001-2015

DOI: 10.1007/s40747-021-00444-4

47Citations

63Readers

Abstract

The traveling salesman problem (TSP) is one of the best-known combinatorial optimization problems. Many methods derived from TSP have been applied to study autonomous vehicle route planning with fuel constraints. Nevertheless, less attention has been paid to reinforcement learning (RL) as a potential method to solve refueling problems. This paper employs RL to solve the traveling salesman problem With refueling (TSPWR). The technique proposes a model (actions, states, reinforcements) and RL-TSPWR algorithm. Focus is given on the analysis of RL parameters and on the refueling influence in route learning optimization of fuel cost. Two RL algorithms: Q-learning and SARSA are compared. In addition, RL parameter estimation is performed by Response Surface Methodology, Analysis of Variance and Tukey Test. The proposed method achieves the best solution in 15 out of 16 case studies.

Author supplied keywords

Cite

CITATION STYLE

APA

Ottoni, A. L. C., Nepomuceno, E. G., Oliveira, M. S. de, & Oliveira, D. C. R. de. (2022). Reinforcement learning for the traveling salesman problem with refueling. Complex and Intelligent Systems, 8(3), 2001–2015. https://doi.org/10.1007/s40747-021-00444-4

Reinforcement learning for the traveling salesman problem with refueling

Abstract

Author supplied keywords

Cite

Register to see more suggestions