A new distributed reinforcement learning algorithm for multiple objective optimization problems

Carlos Mariano; Eduardo Morales

Conference Proceedings

A new distributed reinforcement learning algorithm for multiple objective optimization problems

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2000) 1952 LNAI 290-299

DOI: 10.1007/3-540-44399-1_30

16Citations

30Readers

Get full text

Abstract

This paper describes a new algorithm, called MDQL, for the solution of multiple objective optimization problems. MDQL is based on a new distributed Q-learning algorithm, called DQL, which is also introduced in this paper. In DQL a family of independent agents, exploring different options, finds a common policy in a common environment. Information about action goodness is transmitted using traces over state-action pairs. MDQL extends this idea to multiple objectives, assigning a family of agents for each objective involved. A non-dominant criterion is used to construct Pareto fronts and by delaying adjustments on the rewards MDQL achieves better distributions of solutions. Furthermore, an extension for applying reinforcement learning to continuous functions is also given. Successful results of MDQL on several test-bed problems suggested in the literature are described. © Springer-Verlag 2000.

Cite

CITATION STYLE

APA

Mariano, C., & Morales, E. (2000). A new distributed reinforcement learning algorithm for multiple objective optimization problems. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 1952 LNAI, pp. 290–299). Springer Verlag. https://doi.org/10.1007/3-540-44399-1_30

A new distributed reinforcement learning algorithm for multiple objective optimization problems

Abstract

Cite

Register to see more suggestions