Coordinated rule acquisition of decision making on supply chain by exploitation-oriented reinforcement learning -beer game as an example-

4Citations
Citations of this article
15Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Product order decision-making is an important feature of inventory control in supply chains. The beer game represents a typical task in this process. Recent approaches that have applied the agent model to the beer game have shown. Q-learning performing better than genetic algorithm (GA). However, flexibly adapting to dynamic environment is difficult for these approaches because their learning algorithm assume a static environment. As exploitation-oriented reinforcement learning algorithm are robust in dynamic environments, this study, approaches the beer game using profit sharing, a typical exploitation-oriented agent learning algorithm, and verifies its result's validity by comparing performances. © 2013 Springer-Verlag Berlin Heidelberg.

Cite

CITATION STYLE

APA

Saitoh, F., & Utani, A. (2013). Coordinated rule acquisition of decision making on supply chain by exploitation-oriented reinforcement learning -beer game as an example-. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 8131 LNCS, pp. 537–544). https://doi.org/10.1007/978-3-642-40728-4_67

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free