Modelling coordination of learning systems: A reservoir systems approach to dopamine modulated pavlovian conditioning

3Citations
Citations of this article
17Readers
Mendeley users who have this article in their library.
Get full text

Abstract

This paper presents a biologically constrained reward prediction model capable of learning cue-outcome associations involving temporally distant stimuli without using the commonly used temporal difference model. The model incorporates a novel use of an adapted echo state network to substitute the biologically implausible delay chains usually used, in relation to dopamine phenomena, for tackling temporally structured stimuli. Moreover, the model is based on a novel algorithm which successfully coordinates two sub systems: one providing Pavlovian conditioning, one providing timely inhibition of dopamine responses to salient anticipated stimuli. The model is validated against the typical profile of phasic dopamine in first and second order Pavlovian conditioning. The model is relevant not only to explaining the mechanisms underlying the biological regulation of dopamine signals, but also for applications in autonomous robotics involving reinforcement-based learning. © 2011 Springer-Verlag.

Cite

CITATION STYLE

APA

Lowe, R., Mannella, F., Ziemke, T., & Baldassarre, G. (2011). Modelling coordination of learning systems: A reservoir systems approach to dopamine modulated pavlovian conditioning. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 5777 LNAI, pp. 410–417). https://doi.org/10.1007/978-3-642-21283-3_51

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free