Feasibility of dopamine as a vector-valued feedback signal in the basal ganglia

4Citations
Citations of this article
16Readers
Mendeley users who have this article in their library.
Get full text

Abstract

It is well established that midbrain dopaminergic neurons support reinforcement learning (RL) in the basal ganglia by transmitting a reward prediction error (RPE) to the striatum. In particular, different computational models and experiments have shown that a striatum-wide RPE signal can support RL over a small discrete set of actions (e.g., no/no-go, choose left/right). However, there is accumulating evidence that the basal ganglia functions not as a selector between predefined actions but rather as a dynamical system with graded, continuous outputs. To reconcile this view with RL, there is a need to explain how dopamine could support learning of continuous outputs, rather than discrete action values. Inspired by the recent observations that besides RPE, the firing rates of midbrain dopaminergic neurons correlate with motor and cognitive variables, we propose a model in which dopamine signal in the striatum carries a vector-valued error feedback signal (a loss gradient) instead of a homogeneous scalar error (a loss). We implement a local, “three-factor” corticostriatal plasticity rule involving the presynaptic firing rate, a postsynaptic factor, and the unique dopamine concentration perceived by each striatal neuron. With this learning rule, we show that such a vector-valued feedback signal results in an increased capacity to learn a multidimensional series of real-valued outputs. Crucially, we demonstrate that this plasticity rule does not require precise nigrostriatal synapses but remains compatible with experimental observations of random placement of varicosities and diffuse volume transmission of dopamine.

References Powered by Scopus

A neural substrate of prediction and reward

6665Citations
N/AReaders
Get full text

Julia: A fresh approach to numerical computing

3819Citations
N/AReaders
Get full text

Harnessing Nonlinearity: Predicting Chaotic Systems and Saving Energy in Wireless Communication

2920Citations
N/AReaders
Get full text

Cited by Powered by Scopus

Reward Bases: A simple mechanism for adaptive acquisition of multiple reward types

3Citations
N/AReaders
Get full text

Predictive Representations: Building Blocks of Intelligence

1Citations
N/AReaders
Get full text

Chaotic recurrent neural networks for brain modelling: A review

0Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Wärnberg, E., & Kumar, A. (2023). Feasibility of dopamine as a vector-valued feedback signal in the basal ganglia. Proceedings of the National Academy of Sciences of the United States of America, 120(32). https://doi.org/10.1073/pnas.2221994120

Readers over time

‘23‘24‘2502468

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 5

50%

Researcher 4

40%

Professor / Associate Prof. 1

10%

Readers' Discipline

Tooltip

Neuroscience 4

50%

Computer Science 2

25%

Social Sciences 1

13%

Engineering 1

13%

Article Metrics

Tooltip
Mentions
News Mentions: 1

Save time finding and organizing research with Mendeley

Sign up for free
0