Improving chemical reaction yield prediction using pre-trained graph neural networks

Jongmin Han; Youngchun Kwon; Youn Suk Choi; Seokho Kang

Journal ArticleOPEN ACCESS

Improving chemical reaction yield prediction using pre-trained graph neural networks

Journal of Cheminformatics (2024) 16(1)

DOI: 10.1186/s13321-024-00818-z

19Citations

26Readers

Abstract

Graph neural networks (GNNs) have proven to be effective in the prediction of chemical reaction yields. However, their performance tends to deteriorate when they are trained using an insufficient training dataset in terms of quantity or diversity. A promising solution to alleviate this issue is to pre-train a GNN on a large-scale molecular database. In this study, we investigate the effectiveness of GNN pre-training in chemical reaction yield prediction. We present a novel GNN pre-training method for performance improvement.Given a molecular database consisting of a large number of molecules, we calculate molecular descriptors for each molecule and reduce the dimensionality of these descriptors by applying principal component analysis. We define a pre-text task by assigning a vector of principal component scores as the pseudo-label to each molecule in the database. A GNN is then pre-trained to perform the pre-text task of predicting the pseudo-label for the input molecule. For chemical reaction yield prediction, a prediction model is initialized using the pre-trained GNN and then fine-tuned with the training dataset containing chemical reactions and their yields. We demonstrate the effectiveness of the proposed method through experimental evaluation on benchmark datasets.

Author supplied keywords

Cite

CITATION STYLE

APA

Han, J., Kwon, Y., Choi, Y. S., & Kang, S. (2024). Improving chemical reaction yield prediction using pre-trained graph neural networks. Journal of Cheminformatics, 16(1). https://doi.org/10.1186/s13321-024-00818-z

Improving chemical reaction yield prediction using pre-trained graph neural networks

Abstract

Author supplied keywords

Cite

Register to see more suggestions