Distributed training large-scale deep architectures

Shang Xuan Zou; Chun Yen Chen; Jui Lin Wu; Chun Nan Chou; Chia Chin Tsao; Kuan Chieh Tung; Ting Wei Lin; Cheng Lung Sung; Edward Y. Chang

Conference Proceedings

Distributed training large-scale deep architectures

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2017) 10604 LNAI 18-32

DOI: 10.1007/978-3-319-69179-4_2

10Citations

33Readers

Get full text

Abstract

Scale of data and scale of computation infrastructures together enable the current deep learning renaissance. However, training large-scale deep architectures demands both algorithmic improvement and careful system configuration. In this paper, we focus on employing the system approach to speed up large-scale training. Taking both the algorithmic and system aspects into consideration, we develop a procedure for setting mini-batch size and choosing computation algorithms. We also derive lemmas for determining the quantity of key components such as the number of GPUs and parameter servers. Experiments and examples show that these guidelines help effectively speed up large-scale deep learning training.

Author supplied keywords

Cite

CITATION STYLE

APA

Zou, S. X., Chen, C. Y., Wu, J. L., Chou, C. N., Tsao, C. C., Tung, K. C., … Chang, E. Y. (2017). Distributed training large-scale deep architectures. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 10604 LNAI, pp. 18–32). Springer Verlag. https://doi.org/10.1007/978-3-319-69179-4_2

Distributed training large-scale deep architectures

Abstract

Author supplied keywords

Cite

Register to see more suggestions