Data efficiency and extrapolation trends in neural network interatomic potentials

Joshua A. Vita; Daniel Schwalbe-Koda

Journal ArticleOPEN ACCESS

Data efficiency and extrapolation trends in neural network interatomic potentials

Machine Learning: Science and Technology (2023) 4(3)

DOI: 10.1088/2632-2153/acf115

7Citations

22Readers

Abstract

Recently, key architectural advances have been proposed for neural network interatomic potentials (NNIPs), such as incorporating message-passing networks, equivariance, or many-body expansion terms. Although modern NNIP models exhibit small differences in test accuracy, this metric is still considered the main target when developing new NNIP architectures. In this work, we show how architectural and optimization choices influence the generalization of NNIPs, revealing trends in molecular dynamics (MD) stability, data efficiency, and loss landscapes. Using the 3BPA dataset, we uncover trends in NNIP errors and robustness to noise, showing these metrics are insufficient to predict MD stability in the high-accuracy regime. With a large-scale study on NequIP, MACE, and their optimizers, we show that our metric of loss entropy predicts out-of-distribution error and data efficiency despite being computed only on the training set. This work provides a deep learning justification for probing extrapolation and can inform the development of next-generation NNIPs.

Author supplied keywords

Cite

CITATION STYLE

APA

Vita, J. A., & Schwalbe-Koda, D. (2023). Data efficiency and extrapolation trends in neural network interatomic potentials. Machine Learning: Science and Technology, 4(3). https://doi.org/10.1088/2632-2153/acf115

Data efficiency and extrapolation trends in neural network interatomic potentials

Abstract

Author supplied keywords

Cite

Register to see more suggestions