Death and suicide in universal artificial intelligence

Jarryd Martin; Tom Everitt; Marcus Hutter

Conference Proceedings

Death and suicide in universal artificial intelligence

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2016) 9782 23-32

DOI: 10.1007/978-3-319-41649-6_3

5Citations

61Readers

Get full text

Abstract

Reinforcement learning (RL) is a general paradigm for studying intelligent behaviour, with applications ranging from artificial intelligence to psychology and economics. AIXI is a universal solution to the RL problem; it can learn any computable environment. A technical subtlety of AIXI is that it is defined using a mixture over semimeasures that need not sum to 1, rather than over proper probability measures. In this work we argue that the shortfall of a semimeasure can naturally be interpreted as the agent’s estimate of the probability of its death. We formally define death for generally intelligent agents like AIXI, and prove a number of related theorems about their behaviour. Notable discoveries include that agent behaviour can change radically under positive linear transformations of the reward signal (from suicidal to dogmatically self-preserving), and that the agent’s posterior belief that it will survive increases over time.

Cite

CITATION STYLE

APA

Martin, J., Everitt, T., & Hutter, M. (2016). Death and suicide in universal artificial intelligence. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 9782, pp. 23–32). Springer Verlag. https://doi.org/10.1007/978-3-319-41649-6_3

Death and suicide in universal artificial intelligence

Abstract

Cite

Register to see more suggestions