A definition of happiness for reinforcement learning agents

Mayank Daswani; Jan Leike

Conference Proceedings

A definition of happiness for reinforcement learning agents

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2015) 9205 231-240

DOI: 10.1007/978-3-319-21365-1_24

5Citations

23Readers

Get full text

Abstract

What is happiness for reinforcement learning agents? We seek a formal definition satisfying a list of desiderata. Our proposed definition of happiness is the temporal difference error, i.e. the difference between the value of the obtained reward and observation and the agent’s expectation of this value. This definition satisfies most of our desiderata and is compatible with empirical research on humans. We state several implications and discuss examples.

Author supplied keywords

Cite

CITATION STYLE

APA

Daswani, M., & Leike, J. (2015). A definition of happiness for reinforcement learning agents. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 9205, pp. 231–240). Springer Verlag. https://doi.org/10.1007/978-3-319-21365-1_24

A definition of happiness for reinforcement learning agents

Abstract

Author supplied keywords

Cite

Register to see more suggestions