MARL-Based Cooperative Multi-AGV Control in Warehouse Systems

9Citations
Citations of this article
35Readers
Mendeley users who have this article in their library.

Abstract

Automated guided vehicles (AGVs) are essential components for the automation of fulfillment centers, a type of warehouse, where goods are stored on shelves and carried by AGVs. To increase the productivity in inventory management, a well-organized cooperative path control is required to transport goods to the designated picking stations. In this paper, we propose a QMIX-based scheme for the cooperative path control of multiple AGVs. Although QMIX is the one of popular cooperative multi-agent reinforcement learning algorithms, we find that QMIX alone was not enough to increase productivity in warehouse systems. So, we develop two novel techniques that can be used with QMIX: 1) sequential action masking that eliminates all the collision cases and 2) additional local loss that improves collaboration of individual AGVs. They help to encourage the AGVs to cooperate more for high productivity. By extensive simulations, we present the superiority of the proposed scheme on several layouts in fulfillment centers. The effect of cooperation among AGVs in the proposed scheme is verified through the comparison study with the existing algorithms. Additionally, we show the generalization performance by investigating the reusability of the model trained with the proposed scheme.

Cite

CITATION STYLE

APA

Choi, H. B., Kim, J. B., Han, Y. H., Oh, S. W., & Kim, K. (2022). MARL-Based Cooperative Multi-AGV Control in Warehouse Systems. IEEE Access, 10, 100478–100488. https://doi.org/10.1109/ACCESS.2022.3206537

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free