Reinforcement learning with associative or discriminative generalization across states and actions: fMRI at 3 T and 7 T

7Citations
Citations of this article
27Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

The model-free algorithms of “reinforcement learning” (RL) have gained clout across disciplines, but so too have model-based alternatives. The present study emphasizes other dimensions of this model space in consideration of associative or discriminative generalization across states and actions. This “generalized reinforcement learning” (GRL) model, a frugal extension of RL, parsimoniously retains the single reward-prediction error (RPE), but the scope of learning goes beyond the experienced state and action. Instead, the generalized RPE is efficiently relayed for bidirectional counterfactual updating of value estimates for other representations. Aided by structural information but as an implicit rather than explicit cognitive map, GRL provided the most precise account of human behavior and individual differences in a reversal-learning task with hierarchical structure that encouraged inverse generalization across both states and actions. Reflecting inference that could be true, false (i.e., overgeneralization), or absent (i.e., undergeneralization), state generalization distinguished those who learned well more so than action generalization. With high-resolution high-field fMRI targeting the dopaminergic midbrain, the GRL model's RPE signals (alongside value and decision signals) were localized within not only the striatum but also the substantia nigra and the ventral tegmental area, including specific effects of generalization that also extend to the hippocampus. Factoring in generalization as a multidimensional process in value-based learning, these findings shed light on complexities that, while challenging classic RL, can still be resolved within the bounds of its core computations.

References Powered by Scopus

A New Look at the Statistical Model Identification

41187Citations
N/AReaders
Get full text

The Psychophysics Toolbox

15273Citations
N/AReaders
Get full text

DYNAMIC PROGRAMMING

9849Citations
N/AReaders
Get full text

Cited by Powered by Scopus

Decision heuristics in contexts integrating action selection and execution

6Citations
N/AReaders
Get full text

Not all discounts are created equal: Regional activity and brain networks in temporal and effort discounting

4Citations
N/AReaders
Get full text

Active reinforcement learning versus action bias and hysteresis: control with a mixture of experts and nonexperts

1Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Colas, J. T., Dundon, N. M., Gerraty, R. T., Saragosa-Harris, N. M., Szymula, K. P., Tanwisuth, K., … O’Doherty, J. P. (2022). Reinforcement learning with associative or discriminative generalization across states and actions: fMRI at 3 T and 7 T. Human Brain Mapping, 43(15), 4750–4790. https://doi.org/10.1002/hbm.25988

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 9

60%

Researcher 5

33%

Professor / Associate Prof. 1

7%

Readers' Discipline

Tooltip

Psychology 7

54%

Neuroscience 4

31%

Engineering 1

8%

Arts and Humanities 1

8%

Save time finding and organizing research with Mendeley

Sign up for free