Robustly interrogating machine learning-based scoring functions: what are they learning?

Guy Durant; Fergus Boyles; Kristian Birchall; Brian Marsden; Charlotte M. Deane

Journal ArticleOPEN ACCESS

Robustly interrogating machine learning-based scoring functions: what are they learning?

Bioinformatics (2025) 41(2)

DOI: 10.1093/bioinformatics/btaf040

11Citations

36Readers

Abstract

Motivation: Machine learning-based scoring functions (MLBSFs) have been found to exhibit inconsistent performance on different benchmarks and be prone to learning dataset bias. For the field to develop MLBSFs that learn a generalizable understanding of physics, a more rigorous understanding of how they perform is required. Results: In this work, we compared the performance of a diverse set of popular MLBSFs (RFScore, SIGN, OnionNet-2, Pafnucy, and PointVS) to our proposed baseline models that can only learn dataset biases on a range of benchmarks. We found that these baseline models were competitive in accuracy to these MLBSFs in almost all proposed benchmarks, indicating these models only learn dataset biases. Our tests and provided platform, ToolBoxSF, will enable researchers to robustly interrogate MLBSF performance and determine the effect of dataset biases on their predictions. Availability and implementation: https://github.com/guydurant/toolboxsf.

Cite

CITATION STYLE

APA

Durant, G., Boyles, F., Birchall, K., Marsden, B., & Deane, C. M. (2025). Robustly interrogating machine learning-based scoring functions: what are they learning? Bioinformatics, 41(2). https://doi.org/10.1093/bioinformatics/btaf040

Robustly interrogating machine learning-based scoring functions: what are they learning?

Abstract

Cite

Register to see more suggestions