Limited evaluation evolutionary optimization of large neural networks

Jonas Prellberg; Oliver Kramer

Conference Proceedings

Limited evaluation evolutionary optimization of large neural networks

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2018) 11117 LNAI 270-283

DOI: 10.1007/978-3-030-00111-7_23

4Citations

13Readers

Get full text

Abstract

Stochastic gradient descent is the most prevalent algorithm to train neural networks. However, other approaches such as evolutionary algorithms are also applicable to this task. Evolutionary algorithms bring unique trade-offs that are worth exploring, but computational demands have so far restricted exploration to small networks with few parameters. We implement an evolutionary algorithm that executes entirely on the GPU, which allows to efficiently batch-evaluate a whole population of networks. Within this framework, we explore the limited evaluation evolutionary algorithm for neural network training and find that its batch evaluation idea comes with a large accuracy trade-off. In further experiments, we explore crossover operators and find that unprincipled random uniform crossover performs extremely well. Finally, we train a network with 92k parameters on MNIST using an EA and achieve 97.6% test accuracy compared to 98% test accuracy on the same network trained with Adam. Code is available at https://github.com/jprellberg/gpuea.

Cite

CITATION STYLE

APA

Prellberg, J., & Kramer, O. (2018). Limited evaluation evolutionary optimization of large neural networks. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 11117 LNAI, pp. 270–283). Springer Verlag. https://doi.org/10.1007/978-3-030-00111-7_23

Limited evaluation evolutionary optimization of large neural networks

Abstract

Cite

Register to see more suggestions