Split Q learning: Reinforcement learning with two-stream rewards

11Citations
Citations of this article
31Readers
Mendeley users who have this article in their library.

Abstract

Drawing an inspiration from behavioral studies of human decision making, we propose here a general parametric framework for a reinforcement learning problem, which extends the standard Q-learning approach to incorporate a two-stream framework of reward processing with biases biologically associated with several neurological and psychiatric conditions, including Parkinson's and Alzheimer's diseases, attention-deficit/hyperactivity disorder (ADHD), addiction, and chronic pain. For AI community, the development of agents that react differently to different types of rewards can enable us to understand a wide spectrum of multi-agent interactions in complex real-world socioeconomic systems. Moreover, from the behavioral modeling perspective, our parametric framework can be viewed as a first step towards a unifying computational model capturing reward processing abnormalities across multiple mental conditions and user preferences in long-term recommendation systems.

Cite

CITATION STYLE

APA

Lin, B., Bouneffouf, D., & Cecchi, G. (2019). Split Q learning: Reinforcement learning with two-stream rewards. In IJCAI International Joint Conference on Artificial Intelligence (Vol. 2019-August, pp. 6448–6449). International Joint Conferences on Artificial Intelligence. https://doi.org/10.24963/ijcai.2019/913

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free