Abstract
To develop a supply chain management (SCM) sys-tem that performs optimally for both each entity in the chain and the entire chain, a multi-agent reinforcement learning (MARL) technique has been developed. To solve two problems of the MARL for SCM (build-ing a Markov decision processes for a supply chain and avoiding learning stagnation in a way similar to the " prisoner's dilemma "), a learning management method with deep-neural-network (DNN)-weight evolu-tion (LM-DWE) has been developed. By using a beer distribution game (BDG) as an example of a supply chain, experiments with a four-agent system were per-formed. Consequently, the LM-DWE successfully solved the above two problems and achieved 80.0% lower total cost than expert players of the BDG.
Cite
CITATION STYLE
Fuji, T., Ito, K., Matsumoto, K., & Yano, K. (2018). Deep Multi-Agent Reinforcement Learning using DNN-Weight Evolution to Optimize Supply Chain Performance. In Proceedings of the 51st Hawaii International Conference on System Sciences. Hawaii International Conference on System Sciences. https://doi.org/10.24251/hicss.2018.157
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.