Abstract
Traditional machine learning (ML) metrics overestimate model performance for materials discovery. We introduce (1) leave-one-cluster-out cross-validation (LOCO CV) and (2) a simple nearest-neighbor benchmark to show that model performance in discovery applications strongly depends on the problem, data sampling, and extrapolation. Our results suggest that ML-guided iterative experimentation may outperform standard high-throughput screening for discovering breakthrough materials like high-Tc superconductors with ML.
Cite
CITATION STYLE
Meredig, B., Antono, E., Church, C., Hutchinson, M., Ling, J., Paradiso, S., … Ward, L. (2018). Can machine learning identify the next high-temperature superconductor? Examining extrapolation performance for materials discovery. Molecular Systems Design and Engineering, 3(5), 819–825. https://doi.org/10.1039/c8me00012c
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.