Mode-assisted unsupervised learning of restricted Boltzmann machines

12Citations
Citations of this article
11Readers
Mendeley users who have this article in their library.

Abstract

Restricted Boltzmann machines (RBMs) are a powerful class of generative models, but their training requires computing a gradient that, unlike supervised backpropagation on typical loss functions, is notoriously difficult even to approximate. Here, we show that properly combining standard gradient updates with an off-gradient direction, constructed from samples of the RBM ground state (mode), improves training dramatically over traditional gradient methods. This approach, which we call ‘mode-assisted training’, promotes faster training and stability, in addition to lower converged relative entropy (KL divergence). We demonstrate its efficacy on synthetic datasets where we can compute KL divergences exactly, as well as on a larger machine learning standard (MNIST). The proposed mode-assisted training can be applied in conjunction with any given gradient method, and is easily extended to more general energy-based neural network structures such as deep, convolutional and unrestricted Boltzmann machines.

Cite

CITATION STYLE

APA

Manukian, H., Pei, Y. R., Bearden, S. R. B., & Di Ventra, M. (2020). Mode-assisted unsupervised learning of restricted Boltzmann machines. Communications Physics, 3(1). https://doi.org/10.1038/s42005-020-0373-8

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free