Selection and reinforcement learning for combinatorial optimization

A. Berny

Conference Proceedings

Selection and reinforcement learning for combinatorial optimization

Berny A

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2000) 1917 601-610

DOI: 10.1007/3-540-45356-3_59

21Citations

17Readers

Get full text

Abstract

Improving on a previous paper, we explicitly relate reinforcement and selection learning (PBIL) algorithms for combinatorial optimization, which is understood as the task of finding a fixed-length binary string maximizing an arbitrary function. We show the equivalence of searching for an optimal string and searching for a probability distribution over strings maximizing the function expectation. In this paper however, we will only consider the family of Bernoulli distributions. Next, we introduce two gradient dynamical systems acting on probability vectors. The first one maximizes the expectation of the function and leads to reinforcement learning algorithms whereas the second one maximizes the logarithm of the expectation of the function and leads to selection learning algorithms. We finally give a stability analysis of solutions.

Cite

CITATION STYLE

APA

Berny, A. (2000). Selection and reinforcement learning for combinatorial optimization. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 1917, pp. 601–610). Springer Verlag. https://doi.org/10.1007/3-540-45356-3_59

Selection and reinforcement learning for combinatorial optimization

Abstract

Cite

Register to see more suggestions