Abstract
Utilizing recently introduced concepts from statistics and quantitative risk management, we present a general variant of Batch Normalization (BN) that offers accelerated convergence of Neural Network training compared to conventional BN. In general, we show that mean and standard deviation are not always the most appropriate choice for the centering and scaling procedure within the BN transformation, particularly if ReLU follows the normalization step. We present a Generalized Batch Normalization (GBN) transformation, which can utilize a variety of alternative deviation measures for scaling and statistics for centering, choices which naturally arise from the theory of generalized deviation measures and risk theory in general. When used in conjunction with the ReLU non-linearity, the underlying risk theory suggests natural, arguably optimal choices for the deviation measure and statistic. Utilizing the suggested deviation measure and statistic, we show experimentally that training is accelerated more so than with conventional BN, often with improved error rate as well. Overall, we propose a more flexible BN transformation supported by a complimentary theoretical framework that can potentially guide design choices. Consider a single layer of the network which first receives output from the previous layer h and then applies an affine transformation to get x = Wh + b, followed by an element-wise non-linearity to produce output g(x) which is fed to the next layer. Let x = [x1, x2, · · ·, xn]T denote its individual components so that we can write g(x) = [g(x1), g(x2),..., g(xn)]T,.
Cite
CITATION STYLE
Yuan, X., Feng, Z., Norton, M., & Li, X. (2019). Generalized batch normalization: Towards accelerating deep neural networks. In 33rd AAAI Conference on Artificial Intelligence, AAAI 2019, 31st Innovative Applications of Artificial Intelligence Conference, IAAI 2019 and the 9th AAAI Symposium on Educational Advances in Artificial Intelligence, EAAI 2019 (pp. 1682–1689). AAAI Press. https://doi.org/10.1609/aaai.v33i01.33011682
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.