The HSIC bottleneck: Deep learning without back-propagation

126Citations
Citations of this article
278Readers
Mendeley users who have this article in their library.

Abstract

We introduce the HSIC (Hilbert-Schmidt independence criterion) bottleneck for training deep neural networks. The HSIC bottleneck is an alternative to the conventional cross-entropy loss and backpropagation that has a number of distinct advantages. It mitigates exploding and vanishing gradients, resulting in the ability to learn very deep networks without skip connections. There is no requirement for symmetric feedback or update locking. We find that the HSIC bottleneck provides performance on MNIST/FashionMNIST/CIFAR10 classification comparable to backpropagation with a cross-entropy target, even when the system is not encouraged to make the output resemble the classification labels. Appending a single layer trained with SGD (without backpropagation) to reformat the information further improves performance.

Cite

CITATION STYLE

APA

Kurt Ma, W. D., Lewis, J. P., & Bastiaan Kleijn, W. (2020). The HSIC bottleneck: Deep learning without back-propagation. In AAAI 2020 - 34th AAAI Conference on Artificial Intelligence (pp. 5085–5092). AAAI press. https://doi.org/10.1609/aaai.v34i04.5950

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free