Distributed B-SDLM: Accelerating the training convergence of deep neural networks through parallelism

Shan Sung Liew; Mohamed Khalil-Hani; Rabia Bakhteri

Conference Proceedings

Distributed B-SDLM: Accelerating the training convergence of deep neural networks through parallelism

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2016) 9810 LNCS 243-250

DOI: 10.1007/978-3-319-42911-3_20

0Citations

4Readers

Get full text

Abstract

This paper proposes an efficient asynchronous stochastic second order learning algorithm for distributed learning of neural networks (NNs). The proposed algorithm, named distributed bounded stochastic diagonal Levenberg-Marquardt (distributed B-SDLM), is based on the B-SDLM algorithm that converges fast and requires only minimal computational overhead than the stochastic gradient descent (SGD) method. The proposed algorithm is implemented based on the parameter server thread model in the MPICH implementation. Experiments on the MNIST dataset have shown that training using the distributed B-SDLM on a 16-core CPU cluster allows the convolutional neural network (CNN) model to reach the convergence state very fast, with speedups of 6.03× and 12.28× to reach 0.01 training and 0.08 testing loss values, respectively. This also results in significantly less time taken to reach a certain classification accuracy (5.67× and 8.72× faster to reach 99% training and 98% testing accuracies on the MNIST dataset, respectively).

Author supplied keywords

Cite

CITATION STYLE

APA

Liew, S. S., Khalil-Hani, M., & Bakhteri, R. (2016). Distributed B-SDLM: Accelerating the training convergence of deep neural networks through parallelism. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 9810 LNCS, pp. 243–250). Springer Verlag. https://doi.org/10.1007/978-3-319-42911-3_20

Distributed B-SDLM: Accelerating the training convergence of deep neural networks through parallelism

Abstract

Author supplied keywords

Cite

Register to see more suggestions