Deep Innovation Protection: Confronting the Credit Assignment Problem in Training Heterogeneous Neural Architectures

Sebastian Risi; Kenneth O. Stanley

Conference ProceedingsOPEN ACCESS

Deep Innovation Protection: Confronting the Credit Assignment Problem in Training Heterogeneous Neural Architectures

35th AAAI Conference on Artificial Intelligence, AAAI 2021 (2021) 14A 12391-12399

DOI: 10.1609/aaai.v35i14.17470

0Citations

10Readers

Abstract

Deep reinforcement learning approaches have shown impressive results in a variety of different domains, however, more complex heterogeneous architectures such as world models require the different neural components to be trained separately instead of end-to-end. While a simple genetic algorithm recently showed end-to-end training is possible, it failed to solve a more complex 3D task. This paper presents a method called Deep Innovation Protection (DIP) that addresses the credit assignment problem in training complex heterogenous neural network models end-to-end for such environments. The main idea behind the approach is to employ multiobjective optimization to temporally reduce the selection pressure on specific components in multi-component network, allowing other components to adapt. We investigate the emergent representations of these evolved networks, which learn to predict properties important for the survival of the agent, without the need for a specific forward-prediction loss.

Cite

CITATION STYLE

APA

Risi, S., & Stanley, K. O. (2021). Deep Innovation Protection: Confronting the Credit Assignment Problem in Training Heterogeneous Neural Architectures. In 35th AAAI Conference on Artificial Intelligence, AAAI 2021 (Vol. 14A, pp. 12391–12399). Association for the Advancement of Artificial Intelligence. https://doi.org/10.1609/aaai.v35i14.17470

Deep Innovation Protection: Confronting the Credit Assignment Problem in Training Heterogeneous Neural Architectures

Abstract

Cite

Register to see more suggestions