Deep Innovation Protection: Confronting the Credit Assignment Problem in Training Heterogeneous Neural Architectures

0Citations
Citations of this article
10Readers
Mendeley users who have this article in their library.

Abstract

Deep reinforcement learning approaches have shown impressive results in a variety of different domains, however, more complex heterogeneous architectures such as world models require the different neural components to be trained separately instead of end-to-end. While a simple genetic algorithm recently showed end-to-end training is possible, it failed to solve a more complex 3D task. This paper presents a method called Deep Innovation Protection (DIP) that addresses the credit assignment problem in training complex heterogenous neural network models end-to-end for such environments. The main idea behind the approach is to employ multiobjective optimization to temporally reduce the selection pressure on specific components in multi-component network, allowing other components to adapt. We investigate the emergent representations of these evolved networks, which learn to predict properties important for the survival of the agent, without the need for a specific forward-prediction loss.

Cite

CITATION STYLE

APA

Risi, S., & Stanley, K. O. (2021). Deep Innovation Protection: Confronting the Credit Assignment Problem in Training Heterogeneous Neural Architectures. In 35th AAAI Conference on Artificial Intelligence, AAAI 2021 (Vol. 14A, pp. 12391–12399). Association for the Advancement of Artificial Intelligence. https://doi.org/10.1609/aaai.v35i14.17470

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free