Application of learning automata for stochastic online scheduling

1Citations
Citations of this article
5Readers
Mendeley users who have this article in their library.
Get full text

Abstract

We look at a stochastic online scheduling problem where exact joblenghts are unknown and jobs arrive over time. Heuristics exist which perform very well, but do not extend to multi-stage problems where all jobs must be processed by a sequence of machines. We apply Learning Automata (LA), a Reinforcement Learning technique, successfully to such a multi-stage scheduling setting. We use a Learning Automaton at each decision point in the production chain. Each Learning Automaton has a probability distribution over the machines it can chose. The difference with simple randomization algorithms is the update rule used by the LA. Whenever a job is finished, the LA are notified and update their probability distribution: if the job was finished faster than expected the probability for selecting the same action is increased, otherwise it is decreased. Due to this adaptation, LA can learn processing capacities of the machines, or more correctly: the entire downstream production chain. © 2010 Springer -Verlag Berlin Heidelberg.

Cite

CITATION STYLE

APA

Martinez, Y., Van Vreckem, B., Catteeuw, D., & Nowe, A. (2010). Application of learning automata for stochastic online scheduling. In Recent Advances in Optimization and its Applications in Engineering (pp. 491–498). Springer Berlin Heidelberg. https://doi.org/10.1007/978-3-642-12598-0_43

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free