Combining multiple correlated reward and shaping signals by measuring confidence

2Citations
Citations of this article
25Readers
Mendeley users who have this article in their library.

Abstract

This paper falls in a line of research that investigates how reinforcement learning algorithms can be sped up by combining two AI techniques: multi-objectivization and ensemble techniques. Multi-objectivization is a process that turns a single-objective problem into a multi-objective one, essentially creating a diverse set of feedback signals for the original single-objective task. These signals can be strategically combined using ensemble techniques to speed up and improve learning. In this paper, we present a novel ensemble technique that detects where and when each of the signals is most informative, and proceeds to use that one for action selection.

Cite

CITATION STYLE

APA

Brys, T., Nowé, A., Kudenko, D., & Taylor, M. E. (2014). Combining multiple correlated reward and shaping signals by measuring confidence. In Belgian/Netherlands Artificial Intelligence Conference (pp. 139–140). University of Groningen. https://doi.org/10.1609/aaai.v28i1.8998

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free