This paper introduces an approach to improving an approximate solution in reinforcement learning by augmenting it with a small overriding patch. Many approximate solutions are smaller and easier to produce than a flat solution, but the best solution within the constraints of the approximation may fall well short of global optimality. We present a technique for efficiently learning a small patch to reduce this gap. Empirical evaluation demonstrates the effectiveness of patching, producing combined solutions that are much closer to global optimality. © Springer-Verlag Berlin Heidelberg 2006.
CITATION STYLE
Kim, M. S., & Uther, W. (2006). Patching approximate solutions in reinforcement learning. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 4212 LNAI, pp. 258–269). Springer Verlag. https://doi.org/10.1007/11871842_27
Mendeley helps you to discover research relevant for your work.