Split Q learning: Reinforcement learning with two-stream rewards

Baihan Lin; Djallel Bouneffouf; Guillermo Cecchi

Conference ProceedingsOPEN ACCESS

Split Q learning: Reinforcement learning with two-stream rewards

IJCAI International Joint Conference on Artificial Intelligence (2019) 2019-August 6448-6449

DOI: 10.24963/ijcai.2019/913

11Citations

31Readers

Abstract

Drawing an inspiration from behavioral studies of human decision making, we propose here a general parametric framework for a reinforcement learning problem, which extends the standard Q-learning approach to incorporate a two-stream framework of reward processing with biases biologically associated with several neurological and psychiatric conditions, including Parkinson's and Alzheimer's diseases, attention-deficit/hyperactivity disorder (ADHD), addiction, and chronic pain. For AI community, the development of agents that react differently to different types of rewards can enable us to understand a wide spectrum of multi-agent interactions in complex real-world socioeconomic systems. Moreover, from the behavioral modeling perspective, our parametric framework can be viewed as a first step towards a unifying computational model capturing reward processing abnormalities across multiple mental conditions and user preferences in long-term recommendation systems.

Cite

CITATION STYLE

APA

Lin, B., Bouneffouf, D., & Cecchi, G. (2019). Split Q learning: Reinforcement learning with two-stream rewards. In IJCAI International Joint Conference on Artificial Intelligence (Vol. 2019-August, pp. 6448–6449). International Joint Conferences on Artificial Intelligence. https://doi.org/10.24963/ijcai.2019/913

Split Q learning: Reinforcement learning with two-stream rewards

Abstract

Cite

Register to see more suggestions