Hit Dexter: A Machine-Learning Model for the Prediction of Frequent Hitters

Conrad Stork; Johannes Wagner; Nils Ole Friedrich; Christina de Bruyn Kops; Martin Šícho; Johannes Kirchmair

Journal ArticleOPEN ACCESS

Hit Dexter: A Machine-Learning Model for the Prediction of Frequent Hitters

ChemMedChem (2018) 13(6) 564-571

DOI: 10.1002/cmdc.201700673

44Citations

68Readers

Abstract

False-positive assay readouts caused by badly behaving compounds—frequent hitters, pan-assay interference compounds (PAINS), aggregators, and others—continue to pose a major challenge to experimental screening. There are only a few in silico methods that allow the prediction of such problematic compounds. We report the development of Hit Dexter, two extremely randomized trees classifiers for the prediction of compounds likely to trigger positive assay readouts either by true promiscuity or by assay interference. The models were trained on a well-prepared dataset extracted from the PubChem Bioassay database, consisting of approximately 311 000 compounds tested for activity on at least 50 proteins. Hit Dexter reached MCC and AUC values of up to 0.67 and 0.96 on an independent test set, respectively. The models are expected to be of high value, in particular to medicinal chemists and biochemists who can use Hit Dexter to identify compounds for which extra caution should be exercised with positive assay readouts. Hit Dexter is available as a free web service at http://hitdexter.zbh. uni-hamburg.de.

Author supplied keywords

Cite

CITATION STYLE

APA

Stork, C., Wagner, J., Friedrich, N. O., de Bruyn Kops, C., Šícho, M., & Kirchmair, J. (2018). Hit Dexter: A Machine-Learning Model for the Prediction of Frequent Hitters. ChemMedChem, 13(6), 564–571. https://doi.org/10.1002/cmdc.201700673

Hit Dexter: A Machine-Learning Model for the Prediction of Frequent Hitters

Abstract

Author supplied keywords

Cite

Register to see more suggestions