Distributed Reinforcement Learning for Optimizing Resource Allocation in Autonomous Logistics Processes

Jan Ole Berndt; Otthein Herzog

Book Chapter

Distributed Reinforcement Learning for Optimizing Resource Allocation in Autonomous Logistics Processes

Springer Science and Business Media B.V., (2013), 429-439

DOI: 10.1007/978-3-642-35966-8_36

2Citations

3Readers

Get full text

Abstract

This paper examines multi-agent coordination for resource allocation tasks in autonomous logistics processes. It identifies requirements for the learning of optimal behavior in a multi-agent setting. Based on a real-world logistics application, the paper distinguishes between single resource allocation by independent agents and joint activities by agent teams. For both cases it introduces adaptations of the Q-learning algorithm and evaluates their convergence as well as their scalability for large scenarios. The results demonstrate that the known conditions for the convergence of multi-agent reinforcement learning are insufficient. This leads to the identification of an additional requirement for convergence in this paper.

Author supplied keywords

Cite

CITATION STYLE

APA

Berndt, J. O., & Herzog, O. (2013). Distributed Reinforcement Learning for Optimizing Resource Allocation in Autonomous Logistics Processes. In Lecture Notes in Logistics (pp. 429–439). Springer Science and Business Media B.V. https://doi.org/10.1007/978-3-642-35966-8_36

Distributed Reinforcement Learning for Optimizing Resource Allocation in Autonomous Logistics Processes

Abstract

Author supplied keywords

Cite

Register to see more suggestions