We consider a biologically plausible model of the basal ganglia that is able to learn a probabilistic two armed bandit task using reinforcement learning. This model is able to choose the best option and to reach optimal performances after only a few trials. However, we show in this study that the influence of exogenous factors such as stimuli salience and/or timing seems to prevail over optimal decision making, hence questioning the very definition of action-selection. What are the ecological conditions for optimal action selection?.
CITATION STYLE
Nallapu, B. T., & Rougier, N. P. (2016). Dynamics of reward based decision making: A computational study. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 9886 LNCS, pp. 322–329). Springer Verlag. https://doi.org/10.1007/978-3-319-44778-0_38
Mendeley helps you to discover research relevant for your work.