Survey on software tools that implement deep learning algorithms on intel/x86 and IBM/Power8/Power9 platforms

7Citations
Citations of this article
7Readers
Mendeley users who have this article in their library.

Abstract

Neural networks are becoming more and more popular in scientific field and in the industry. It is mostly because new solutions using neural networks show state-of-the-art results in the domains previously occupied by traditional methods, eg. computer vision, speech recognition etc. But to get these results neural networks become progressively more complex, thus needing a lot more training. The training of neural networks today can take weeks. This problems can be solved by parallelization of the neural networks training and using modern clusters and supercomputers, which can significantly reduce the learning time. Today, a faster training for data scientist is essential, because it allows to get the results faster to make the next decision. In this paper we provide an overview of distributed learning provided by the popular modern deep learning frameworks, both in terms of provided functionality and performance. We consider multiple hardware choices: training on multiple GPUs and multiple computing nodes.

References Powered by Scopus

ImageNet classification with deep convolutional neural networks

23208Citations
N/AReaders
Get full text

Scalable distributed DNN training using commodity GPU cloud computing

404Citations
N/AReaders
Get full text

A historical perspective of speech recognition

122Citations
N/AReaders
Get full text

Cited by Powered by Scopus

Application of Graphics Processing Units for Self-Consistent Modelling of Shallow Water Dynamics and Sediment Transport

19Citations
N/AReaders
Get full text

Automation of the process of selecting hyperparameters for artificial neural networks for processing retrospective text information

12Citations
N/AReaders
Get full text

Brain tumor classification & segmentation by using advanced DNN, CNN & ResNet-50 neural networks

2Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Shaikhislamov, D., Sozykin, A., & Voevodin, V. (2019). Survey on software tools that implement deep learning algorithms on intel/x86 and IBM/Power8/Power9 platforms. Supercomputing Frontiers and Innovations, 6(4), 57–83. https://doi.org/10.14529/jsfi190404

Readers over time

‘20‘21‘22‘23‘2400.751.52.253

Readers' Seniority

Tooltip

Professor / Associate Prof. 1

33%

PhD / Post grad / Masters / Doc 1

33%

Researcher 1

33%

Readers' Discipline

Tooltip

Computer Science 2

67%

Physics and Astronomy 1

33%

Save time finding and organizing research with Mendeley

Sign up for free
0