Monte-carlo UCT Search

  • Fürnkranz J
  • Scheffer T
  • Spiliopoulou M
  • et al.
N/ACitations
Citations of this article
6Readers
Mendeley users who have this article in their library.

Abstract

For large state-space Markovian Decision Problems Monte-Carlo planning is one of the few viable approaches to find near-optimal solutions. In this paper we introduce a new algorithm, UCT, that applies bandit ideas to guide Monte-Carlo planning. In finite-horizon or discounted MDPs the algorithm is shown to be consistent and finite sample bounds are derived on the estimation error due to sampling. Experimental results show that in several domains, UCT is significantly more efficient than its alternatives.

Cite

CITATION STYLE

APA

Fürnkranz, J., Scheffer, T., Spiliopoulou, M., Kocsis, L., & Szepesvári, C. (2006). Monte-carlo UCT Search. Proceedings of the Eleventh Conference on Uncertainty in Artificial Intelligence, 4212(June), 282-293–293. Retrieved from http://www.springerlink.com/content/d232253353517276/%5Cnhttp://senseis.xmp.net/?UCT

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free