Reinforcement Learning of Causal Variables Using Mediation Analysis

Tue Herlau; Rasmus Larsen

Conference ProceedingsOPEN ACCESS

Reinforcement Learning of Causal Variables Using Mediation Analysis

Proceedings of the 36th AAAI Conference on Artificial Intelligence, AAAI 2022 (2022) 36 6910-6917

DOI: 10.1609/aaai.v36i6.20648

7Citations

12Readers

Abstract

We consider the problem of acquiring causal representations and concepts in a reinforcement learning setting. Our approach defines a causal variable as being both manipulable by a policy, and able to predict the outcome. We thereby obtain a parsimonious causal graph in which interventions occur at the level of policies. The approach avoids defining a generative model of the data, prior pre-processing, or learning the transition kernel of the Markov decision process. Instead, causal variables and policies are determined by maximizing a new optimization target inspired by mediation analysis, which differs from the expected return. The maximization is accomplished using a generalization of Bellman’s equation which is shown to converge, and the method finds meaningful causal representations in a simulated environment.

Cite

CITATION STYLE

APA

Herlau, T., & Larsen, R. (2022). Reinforcement Learning of Causal Variables Using Mediation Analysis. In Proceedings of the 36th AAAI Conference on Artificial Intelligence, AAAI 2022 (Vol. 36, pp. 6910–6917). Association for the Advancement of Artificial Intelligence. https://doi.org/10.1609/aaai.v36i6.20648

Reinforcement Learning of Causal Variables Using Mediation Analysis

Abstract

Cite

Register to see more suggestions