Compound reinforcement learning

0Citations
Citations of this article
10Readers
Mendeley users who have this article in their library.

Abstract

This paper describes a reinforcement learning framework based on compound returns, which is called compound reinforcement learning. Compound reinforcement learning maximizes the compound return in returns-based MDPs. We also describe compound Q-learning algorithm. We present experimental results using an ilustrative example, 2-armed bandit.

Cite

CITATION STYLE

APA

Matsui, T. (2011). Compound reinforcement learning. Transactions of the Japanese Society for Artificial Intelligence, 26(2), 330–334. https://doi.org/10.1527/tjsai.26.330

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free