Deep Multi-Agent Reinforcement Learning using DNN-Weight Evolution to Optimize Supply Chain Performance

  • Fuji T
  • Ito K
  • Matsumoto K
  • et al.
N/ACitations
Citations of this article
46Readers
Mendeley users who have this article in their library.

Abstract

To develop a supply chain management (SCM) sys-tem that performs optimally for both each entity in the chain and the entire chain, a multi-agent reinforcement learning (MARL) technique has been developed. To solve two problems of the MARL for SCM (build-ing a Markov decision processes for a supply chain and avoiding learning stagnation in a way similar to the " prisoner's dilemma "), a learning management method with deep-neural-network (DNN)-weight evolu-tion (LM-DWE) has been developed. By using a beer distribution game (BDG) as an example of a supply chain, experiments with a four-agent system were per-formed. Consequently, the LM-DWE successfully solved the above two problems and achieved 80.0% lower total cost than expert players of the BDG.

Cite

CITATION STYLE

APA

Fuji, T., Ito, K., Matsumoto, K., & Yano, K. (2018). Deep Multi-Agent Reinforcement Learning using DNN-Weight Evolution to Optimize Supply Chain Performance. In Proceedings of the 51st Hawaii International Conference on System Sciences. Hawaii International Conference on System Sciences. https://doi.org/10.24251/hicss.2018.157

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free