Towards overcoming data scarcity in materials science: unifying models and datasets with a mixture of experts framework

36Citations
Citations of this article
58Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

While machine learning has emerged in recent years as a useful tool for the rapid prediction of materials properties, generating sufficient data to reliably train models without overfitting is often impractical. Towards overcoming this limitation, we present a general framework for leveraging complementary information across different models and datasets for accurate prediction of data-scarce materials properties. Our approach, based on a machine learning paradigm called mixture of experts, outperforms pairwise transfer learning on 14 of 19 materials property regression tasks, performing comparably on four of the remaining five. The approach is interpretable, model-agnostic, and scalable to combining an arbitrary number of pre-trained models and datasets to any downstream property prediction task. We anticipate the performance of our framework will further improve as better model architectures, new pre-training tasks, and larger materials datasets are developed by the community.

Cite

CITATION STYLE

APA

Chang, R., Wang, Y. X., & Ertekin, E. (2022). Towards overcoming data scarcity in materials science: unifying models and datasets with a mixture of experts framework. Npj Computational Materials, 8(1). https://doi.org/10.1038/s41524-022-00929-x

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free