Data Curation can Improve the Prediction Accuracy of Metabolic Intrinsic Clearance

Tsuyoshi Esaki; Reiko Watanabe; Hitoshi Kawashima; Rikiya Ohashi; Yayoi Natsume-Kitatani; Chioko Nagao; Kenji Mizuguchi

Journal ArticleOPEN ACCESS

Data Curation can Improve the Prediction Accuracy of Metabolic Intrinsic Clearance

Molecular Informatics (2019) 38(1)

DOI: 10.1002/minf.201800086

31Citations

66Readers

Abstract

A key consideration at the screening stages of drug discovery is in vitro metabolic stability, often measured in human liver microsomes. Computational prediction models can be built using a large quantity of experimental data available from public databases, but these databases typically contain data measured using various protocols in different laboratories, raising the issue of data quality. In this study, we retrieved the intrinsic clearance (CLint) measurements from an open database and performed extensive manual curation. Then, chemical descriptors were calculated using freely available software, and prediction models were built using machine learning algorithms. The models trained on the curated data showed better performance than those trained on the non-curated data and achieved performance comparable to previously published models, showing the importance of manual curation in data preparation. The curated data were made available, to make our models fully reproducible.

Author supplied keywords

Cite

CITATION STYLE

APA

Esaki, T., Watanabe, R., Kawashima, H., Ohashi, R., Natsume-Kitatani, Y., Nagao, C., & Mizuguchi, K. (2019). Data Curation can Improve the Prediction Accuracy of Metabolic Intrinsic Clearance. Molecular Informatics, 38(1). https://doi.org/10.1002/minf.201800086

Data Curation can Improve the Prediction Accuracy of Metabolic Intrinsic Clearance

Abstract

Author supplied keywords

Cite

Register to see more suggestions