Abstract
Motivation: The acid dissociation constant (pKa) is a critical parameter to reflect the ionization ability of chemical compounds and is widely applied in a variety of industries. However, the experimental determination of pKa is intricate and time-consuming, especially for the exact determination of micro-pKa information at the atomic level. Hence, a fast and accurate prediction of pKa values of chemical compounds is of broad interest. Results: Here, we compiled a large-scale pKa dataset containing 16 595 compounds with 17 489 pKa values. Based on this dataset, a novel pKa prediction model, named Graph-pKa, was established using graph neural networks. Graph-pKa performed well on the prediction of macro-pKa values, with a mean absolute error around 0.55 and a coefficient of determination around 0.92 on the test dataset. Furthermore, combining multi-instance learning, Graph-pKa was also able to automatically deconvolute the predicted macro-pKa into discrete micro-pKa values.
Cite
CITATION STYLE
Xiong, J., Li, Z., Wan, G., Fu, Z., Zhong, F., Xu, T., … Zheng, M. (2022). Multi-instance learning of graph neural networks for aqueous pKa prediction. Bioinformatics, 38(3), 792–798. https://doi.org/10.1093/bioinformatics/btab714
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.