Coordinated rule acquisition of decision making on supply chain by exploitation-oriented reinforcement learning -beer game as an example-

Fumiaki Saitoh; Akihide Utani

Conference Proceedings

Coordinated rule acquisition of decision making on supply chain by exploitation-oriented reinforcement learning -beer game as an example-

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2013) 8131 LNCS 537-544

DOI: 10.1007/978-3-642-40728-4_67

4Citations

15Readers

Get full text

Abstract

Product order decision-making is an important feature of inventory control in supply chains. The beer game represents a typical task in this process. Recent approaches that have applied the agent model to the beer game have shown. Q-learning performing better than genetic algorithm (GA). However, flexibly adapting to dynamic environment is difficult for these approaches because their learning algorithm assume a static environment. As exploitation-oriented reinforcement learning algorithm are robust in dynamic environments, this study, approaches the beer game using profit sharing, a typical exploitation-oriented agent learning algorithm, and verifies its result's validity by comparing performances. © 2013 Springer-Verlag Berlin Heidelberg.

Author supplied keywords

Cite

CITATION STYLE

APA

Saitoh, F., & Utani, A. (2013). Coordinated rule acquisition of decision making on supply chain by exploitation-oriented reinforcement learning -beer game as an example-. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 8131 LNCS, pp. 537–544). https://doi.org/10.1007/978-3-642-40728-4_67

Coordinated rule acquisition of decision making on supply chain by exploitation-oriented reinforcement learning -beer game as an example-

Abstract

Author supplied keywords

Cite

Register to see more suggestions