Patching approximate solutions in reinforcement learning

Min Sub Kim; William Uther

Conference ProceedingsOPEN ACCESS

Patching approximate solutions in reinforcement learning

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2006) 4212 LNAI 258-269

DOI: 10.1007/11871842_27

0Citations

2Readers

Abstract

This paper introduces an approach to improving an approximate solution in reinforcement learning by augmenting it with a small overriding patch. Many approximate solutions are smaller and easier to produce than a flat solution, but the best solution within the constraints of the approximation may fall well short of global optimality. We present a technique for efficiently learning a small patch to reduce this gap. Empirical evaluation demonstrates the effectiveness of patching, producing combined solutions that are much closer to global optimality. © Springer-Verlag Berlin Heidelberg 2006.

Cite

CITATION STYLE

APA

Kim, M. S., & Uther, W. (2006). Patching approximate solutions in reinforcement learning. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 4212 LNAI, pp. 258–269). Springer Verlag. https://doi.org/10.1007/11871842_27

Patching approximate solutions in reinforcement learning

Abstract

Cite

Register to see more suggestions