Compound reinforcement learning

Tohgoroh Matsui

Journal ArticleOPEN ACCESS

Compound reinforcement learning

Matsui T

Transactions of the Japanese Society for Artificial Intelligence (2011) 26(2) 330-334

DOI: 10.1527/tjsai.26.330

0Citations

10Readers

Abstract

This paper describes a reinforcement learning framework based on compound returns, which is called compound reinforcement learning. Compound reinforcement learning maximizes the compound return in returns-based MDPs. We also describe compound Q-learning algorithm. We present experimental results using an ilustrative example, 2-armed bandit.

Author supplied keywords

Compound returns
Q-learning
Reinforecement learning
Value functions

Cite

CITATION STYLE

APA

Matsui, T. (2011). Compound reinforcement learning. Transactions of the Japanese Society for Artificial Intelligence, 26(2), 330–334. https://doi.org/10.1527/tjsai.26.330

Compound reinforcement learning

Abstract

Author supplied keywords

Cite

Register to see more suggestions