Combining multiple correlated reward and shaping signals by measuring confidence

Tim Brys; Ann Nowé; Daniel Kudenko; Matthew E. Taylor

Conference ProceedingsOPEN ACCESS

Combining multiple correlated reward and shaping signals by measuring confidence

Belgian/Netherlands Artificial Intelligence Conference (2014) 139-140

DOI: 10.1609/aaai.v28i1.8998

2Citations

25Readers

Abstract

This paper falls in a line of research that investigates how reinforcement learning algorithms can be sped up by combining two AI techniques: multi-objectivization and ensemble techniques. Multi-objectivization is a process that turns a single-objective problem into a multi-objective one, essentially creating a diverse set of feedback signals for the original single-objective task. These signals can be strategically combined using ensemble techniques to speed up and improve learning. In this paper, we present a novel ensemble technique that detects where and when each of the signals is most informative, and proceeds to use that one for action selection.

Cite

CITATION STYLE

APA

Brys, T., Nowé, A., Kudenko, D., & Taylor, M. E. (2014). Combining multiple correlated reward and shaping signals by measuring confidence. In Belgian/Netherlands Artificial Intelligence Conference (pp. 139–140). University of Groningen. https://doi.org/10.1609/aaai.v28i1.8998

Combining multiple correlated reward and shaping signals by measuring confidence

Abstract

Cite

Register to see more suggestions