Training restricted boltzmann machines with multi-tempering: Harnessing parallelization

Philemon Brakel; Sander Dieleman; Benjamin Schrauwen

Conference Proceedings

Training restricted boltzmann machines with multi-tempering: Harnessing parallelization

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2012) 7553 LNCS(PART 2) 92-99

DOI: 10.1007/978-3-642-33266-1_12

15Citations

17Readers

Get full text

Abstract

Restricted Boltzmann Machines (RBM's) are unsupervised probabilistic neural networks that can be stacked to form Deep Belief Networks. Given the recent popularity of RBM's and the increasing availability of parallel computing architectures, it becomes interesting to investigate learning algorithms for RBM's that benefit from parallel computations. In this paper, we look at two extensions of the parallel tempering algorithm, which is a Markov Chain Monte Carlo method to approximate the likelihood gradient. The first extension is directed at a more effective exchange of information among the parallel sampling chains. The second extension estimates gradients by averaging over chains from different temperatures. We investigate the efficiency of the proposed methods and demonstrate their usefulness on the MNIST dataset. Especially the weighted averaging seems to benefit Maximum Likelihood learning. © 2012 Springer-Verlag.

Author supplied keywords

Cite

CITATION STYLE

APA

Brakel, P., Dieleman, S., & Schrauwen, B. (2012). Training restricted boltzmann machines with multi-tempering: Harnessing parallelization. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 7553 LNCS, pp. 92–99). https://doi.org/10.1007/978-3-642-33266-1_12

Training restricted boltzmann machines with multi-tempering: Harnessing parallelization

Abstract

Author supplied keywords

Cite

Register to see more suggestions