Hydration free energies from kernel-based machine learning: Compound-database bias

27Citations
Citations of this article
33Readers
Mendeley users who have this article in their library.
Get full text

Abstract

We consider the prediction of a basic thermodynamic property - hydration free energies - across a large subset of the chemical space of small organic molecules. Our in silico study is based on computer simulations at the atomistic level with implicit solvent. We report on a kernel-based machine learning approach that is inspired by recent work in learning electronic properties but differs in key aspects: The representation is averaged over several conformers to account for the statistical ensemble. We also include an atomic-decomposition ansatz, which offers significant added transferability compared to molecular learning. Finally, we explore the existence of severe biases from databases of experimental compounds. By performing a combination of dimensionality reduction and cross-learning models, we show that the rate of learning depends significantly on the breadth and variety of the training dataset. Our study highlights the dangers of fitting machine-learning models to databases of a narrow chemical range.

References Powered by Scopus

Gromacs: High performance molecular simulations through multi-level parallelism from laptops to supercomputers

17045Citations
N/AReaders
Get full text

Electrostatics of nanosystems: Application to microtubules and the ribosome

6258Citations
N/AReaders
Get full text

G-mmpbsa -A GROMACS tool for high-throughput MM-PBSA calculations

3768Citations
N/AReaders
Get full text

Cited by Powered by Scopus

Machine Learning for Chemical Reactions

258Citations
N/AReaders
Get full text

Improved prediction of solvation free energies by machine-learning polarizable continuum solvation model

68Citations
N/AReaders
Get full text

Machine learning of free energies in chemical compound space using ensemble representations: Reaching experimental uncertainty for solvation

44Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Rauer, C., & Bereau, T. (2020). Hydration free energies from kernel-based machine learning: Compound-database bias. Journal of Chemical Physics, 153(1). https://doi.org/10.1063/5.0012230

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 15

65%

Researcher 5

22%

Professor / Associate Prof. 2

9%

Lecturer / Post doc 1

4%

Readers' Discipline

Tooltip

Chemistry 10

56%

Computer Science 3

17%

Physics and Astronomy 3

17%

Materials Science 2

11%

Save time finding and organizing research with Mendeley

Sign up for free