Bandit-based search for constraint programming

Manuel Loth; Michèle Sebag; Youssef Hamadi; Marc Schoenauer

Conference Proceedings

Bandit-based search for constraint programming

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2013) 8124 LNCS 464-480

DOI: 10.1007/978-3-642-40627-0_36

28Citations

6Readers

Get full text

Abstract

Constraint Programming (CP) solvers classically explore the solution space using tree-search based heuristics. Monte-Carlo Tree Search (MCTS), aimed at optimal sequential decision making under uncertainty, gradually grows a search tree to explore the most promising regions according to a specified reward function. At the crossroad of CP and MCTS, this paper presents the Bandit Search for Constraint Programming (BaSCoP) algorithm, adapting MCTS to the specifics of the CP search. This contribution relies on i) a generic reward function suited to CP and compatible with a multiple restart strategy; ii) the use of depth-first search as roll-out procedure in MCTS. BaSCoP, on the top of the Gecode constraint solver, is shown to significantly improve on depth-first search on some CP benchmark suites, demonstrating its relevance as a generic yet robust CP search method. © 2013 Springer-Verlag.

Author supplied keywords

Cite

CITATION STYLE

APA

Loth, M., Sebag, M., Hamadi, Y., & Schoenauer, M. (2013). Bandit-based search for constraint programming. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 8124 LNCS, pp. 464–480). https://doi.org/10.1007/978-3-642-40627-0_36

Bandit-based search for constraint programming

Abstract

Author supplied keywords

Cite

Register to see more suggestions