Performance Optimization for Feature Extraction Section of DeepChem

2Citations
Citations of this article
2Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Based on the popular deep learning technique, the authors at Stanford implement DeepChem as an open source methods for the research in the fields of drug discovery, biology and so on. For the performance problem of training process of DeepChem neural network, this paper rebuilds the original serial feature extraction algorithm of DeepChem and optimizes the rebuilt serial algorithm based on the multiple processes algorithm. The experiment results show that the parallel algorithm achieves 15.38× speedup at the best compared with the serial algorithm. For the future work, first, in addition to the multiprocessing package, the other packages such as concurrent, subprocess and so on could be considered to optimize the feature extraction algorithm; Second, the serial and parallel algorithms run slower when the data block size is 150 compared with the other block sizes, optimization of training process performance for the smaller data block size is the second direction in future work.

Cite

CITATION STYLE

APA

Zhan, K., Lu, Z. H., & Zhang, Y. Q. (2020). Performance Optimization for Feature Extraction Section of DeepChem. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 12452 LNCS, pp. 290–304). Springer Science and Business Media Deutschland GmbH. https://doi.org/10.1007/978-3-030-60245-1_20

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free