Neural network initialization

G. Thimm; E. Fiesler

Conference Proceedings

Neural network initialization

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (1995) 930 535-542

DOI: 10.1007/3-540-59497-3_220

45Citations

19Readers

Get full text

Abstract

Proper initialization is one of the most important prerequisites for fast convergence of feed-forward neural networks like high order and multilayer perceptrons. This publication aims at determining the optimal value of the initial weight variance (or range), which is the principal parameter of random weight initialization methods for both types of neural networks. An overview of random weight initialization methods for multilayer perceptrons is presented. These methods are extensively tested using eight real-world benchmark data sets and a broad range of initial weight variances by means of more than 30,000 simulations, in the aim to find the best weight initialization method for multilayer perceptrons. For high order networks, a large number of experiments (more than 200,000 simulations) was performed, using three weight distributions, three activation functions, several network orders, and the same eight data sets. The results of these experiments arc compared to weight initialization techniques for multilayer perceptrons, which leads to the proposal of a suitable weight initialization method for high order perceplrons. The conclusions on the weight initiahzation methods for both types of networks are justified by sufficiently small confidence intervals of the mean convergence times.

Cite

CITATION STYLE

APA

Thimm, G., & Fiesler, E. (1995). Neural network initialization. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 930, pp. 535–542). Springer Verlag. https://doi.org/10.1007/3-540-59497-3_220

Neural network initialization

Abstract

Cite

Register to see more suggestions