The Upper Confidence Bounds (UCB) algorithm is a well-known near-optimal strategy for the stochastic multi-armed bandit problem. Its extensions to trees, such as the Upper Confidence Tree (UCT) algorithm, have resulted in good solutions to the problem of Go. This paper introduces DUCT, a distributed algorithm inspired by UCT, for solving Distributed Constraint Optimization Problems (DCOP). Bounds on the solution quality are provided, and experiments show that, compared to existing DCOP approaches, DUCT is able to solve very large problems much more efficiently, or to find significantly higher quality solutions.
CITATION STYLE
Ottens, B., Dimitrakakis, C., & Faltings, B. (2012). DUCT: An Upper Confidence Bound Approach to Distributed Constraint Optimization Problems. In Proceedings of the 26th AAAI Conference on Artificial Intelligence, AAAI 2012 (pp. 528–534). AAAI Press. https://doi.org/10.1609/aaai.v26i1.8129
Mendeley helps you to discover research relevant for your work.