Learning dynamic algorithm portfolios

89Citations
Citations of this article
73Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Algorithm selection can be performed using a model of runtime distribution, learned during a preliminary training phase. There is a trade-off between the performance of model-based algorithm selection, and the cost of learning the model. In this paper, we treat this trade-off in the context of bandit problems. We propose a fully dynamic and online algorithm selection technique, with no separate training phase: all candidate algorithms are run in parallel, while a model incrementally learns their runtime distributions. A redundant set of time allocators uses the partially trained model to propose machine time shares for the algorithms. A bandit problem solver mixes the model-based shares with a uniform share, gradually increasing the impact of the best time allocators as the model improves. We present experiments with a set of SAT solvers on a mixed SAT-UNSAT benchmark; and with a set of solvers for the Auction Winner Determination problem. © Springer Science+Business Media, Inc. 2007.

Cite

CITATION STYLE

APA

Gagliolo, M., & Schmidhuber, J. (2006). Learning dynamic algorithm portfolios. In Annals of Mathematics and Artificial Intelligence (Vol. 47, pp. 295–328). https://doi.org/10.1007/s10472-006-9036-z

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free