In this paper we derive a simple reinforcement learning rule based on a more general form of REINFORCE formulation. We test our new rule on both classification and reinforcement problems. The results have shown that although this simple learning rule has a high probability of being stuck in local optimum for the case of classification tasks, it is able to solve some global reinforcement problems (e.g. the cart-pole balancing problem) directly in the continuous space. © Springer-Verlag Berlin Heidelberg 2007.
CITATION STYLE
Ma, X. (2007). An extremely simple reinforcement learning rule for neural networks. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 4491 LNCS, pp. 434–440). Springer Verlag. https://doi.org/10.1007/978-3-540-72383-7_51
Mendeley helps you to discover research relevant for your work.