Evaluating the TD model of classical conditioning

Elliot A. Ludvig; Richard S. Sutton; E. James Kehoe

Journal ArticleOPEN ACCESS

Evaluating the TD model of classical conditioning

Learning and Behavior (2012) 40(3) 305-319

DOI: 10.3758/s13420-012-0082-6

71Citations

128Readers

Abstract

The temporal-difference (TD) algorithm from reinforcement learning provides a simple method for incrementally learning predictions of upcoming events. Applied to classical conditioning, TD models suppose that animals learn a real-time prediction of the unconditioned stimulus (US) on the basis of all available conditioned stimuli (CSs). In the TD model, similar to other error-correction models, learning is driven by prediction errors-the difference between the change in US prediction and the actual US. With the TD model, however, learning occurs continuously from moment to moment and is not artificially constrained to occur in trials. Accordingly, a key feature of any TD model is the assumption about the representation of a CS on a moment-to-moment basis. Here, we evaluate the performance of the TD model with a heretofore unexplored range of classical conditioning tasks. To do so, we consider three stimulus representations that vary in their degree of temporal generalization and evaluate how the representation influences the performance of the TD model on these conditioning tasks. © Psychonomic Society, Inc. 2012.

Author supplied keywords

Cite

CITATION STYLE

APA

Ludvig, E. A., Sutton, R. S., & Kehoe, E. J. (2012). Evaluating the TD model of classical conditioning. Learning and Behavior, 40(3), 305–319. https://doi.org/10.3758/s13420-012-0082-6

Evaluating the TD model of classical conditioning

Abstract

Author supplied keywords

Cite

Register to see more suggestions