This article presents a general class of associative reinforcement learning algorithms for connectionist networks containing stochastic units. These algorithms, called REINFORCE algorithms, are shown to make weight adjustments in a direction that lies along the...
CITATION STYLE
Williams, R. J. (1992). Simple Statistical Gradient-Following Algorithms for Connectionist Reinforcement Learning. In Reinforcement Learning (pp. 5–32). Springer US. https://doi.org/10.1007/978-1-4615-3618-5_2
Mendeley helps you to discover research relevant for your work.