Abstract
In this paper, we present implementations of an annealing-based and a gate-based quantum computing approach for finding the optimal policy to traverse a grid and compare them to a classical deep reinforcement learning approach. We extended these three approaches by allowing for stochastic actions instead of deterministic actions and by introducing a new learning technique called curriculum learning. With curriculum learning, we gradually increase the complexity of the environment and we find that it has a positive effect on the expected reward of a traversal. We see that the number of training steps needed for the two quantum approaches is lower than that needed for the classical approach.
Author supplied keywords
Cite
CITATION STYLE
Neumann, N. M. P., de Heer, P. B. U. L., & Phillipson, F. (2023). Quantum reinforcement learning: Comparing quantum annealing and gate-based quantum computing with classical deep reinforcement learning. Quantum Information Processing, 22(2). https://doi.org/10.1007/s11128-023-03867-9
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.