Performance Optimization for Feature Extraction Section of DeepChem

Ke Zhan; Zhong Hua Lu; Yun Quan Zhang

Conference Proceedings

Performance Optimization for Feature Extraction Section of DeepChem

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2020) 12452 LNCS 290-304

DOI: 10.1007/978-3-030-60245-1_20

2Citations

2Readers

Get full text

Abstract

Based on the popular deep learning technique, the authors at Stanford implement DeepChem as an open source methods for the research in the fields of drug discovery, biology and so on. For the performance problem of training process of DeepChem neural network, this paper rebuilds the original serial feature extraction algorithm of DeepChem and optimizes the rebuilt serial algorithm based on the multiple processes algorithm. The experiment results show that the parallel algorithm achieves 15.38× speedup at the best compared with the serial algorithm. For the future work, first, in addition to the multiprocessing package, the other packages such as concurrent, subprocess and so on could be considered to optimize the feature extraction algorithm; Second, the serial and parallel algorithms run slower when the data block size is 150 compared with the other block sizes, optimization of training process performance for the smaller data block size is the second direction in future work.

Author supplied keywords

Cite

CITATION STYLE

APA

Zhan, K., Lu, Z. H., & Zhang, Y. Q. (2020). Performance Optimization for Feature Extraction Section of DeepChem. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 12452 LNCS, pp. 290–304). Springer Science and Business Media Deutschland GmbH. https://doi.org/10.1007/978-3-030-60245-1_20

Performance Optimization for Feature Extraction Section of DeepChem

Abstract

Author supplied keywords

Cite

Register to see more suggestions