Learning dynamic algorithm portfolios

Matteo Gagliolo; Jürgen Schmidhuber

Conference Proceedings

Learning dynamic algorithm portfolios

Annals of Mathematics and Artificial Intelligence (2006) 47(3-4) 295-328

DOI: 10.1007/s10472-006-9036-z

89Citations

73Readers

Get full text

Abstract

Algorithm selection can be performed using a model of runtime distribution, learned during a preliminary training phase. There is a trade-off between the performance of model-based algorithm selection, and the cost of learning the model. In this paper, we treat this trade-off in the context of bandit problems. We propose a fully dynamic and online algorithm selection technique, with no separate training phase: all candidate algorithms are run in parallel, while a model incrementally learns their runtime distributions. A redundant set of time allocators uses the partially trained model to propose machine time shares for the algorithms. A bandit problem solver mixes the model-based shares with a uniform share, gradually increasing the impact of the best time allocators as the model improves. We present experiments with a set of SAT solvers on a mixed SAT-UNSAT benchmark; and with a set of solvers for the Auction Winner Determination problem. © Springer Science+Business Media, Inc. 2007.

Author supplied keywords

Cite

CITATION STYLE

APA

Gagliolo, M., & Schmidhuber, J. (2006). Learning dynamic algorithm portfolios. In Annals of Mathematics and Artificial Intelligence (Vol. 47, pp. 295–328). https://doi.org/10.1007/s10472-006-9036-z

Learning dynamic algorithm portfolios

Abstract

Author supplied keywords

Cite

Register to see more suggestions