Abstract
Understanding and predicting the properties of inorganic materials is crucial for accelerating advancements in materials science and driving applications in energy, electronics and beyond. Integrating material structure data with language-based information through multimodal large language models (LLMs) offers great potential to support these efforts by enhancing human–artificial intelligence interaction. However, a key challenge lies in integrating atomic structures at full resolution into LLMs. In this work, we introduce MatterChat, a versatile structure-aware multimodal LLM that unifies material structural data and textual inputs into a single cohesive model. MatterChat uses a bridging module to effectively align a pretrained universal machine learning interatomic potential with a pretrained LLM, reducing training costs and enhancing flexibility. Our results demonstrate that MatterChat greatly improves performance in material property prediction and human–artificial intelligence interaction, surpassing general-purpose LLMs such as GPT-4. We also demonstrate its usefulness in applications such as more advanced scientific reasoning and step-by-step material synthesis.
Cite
CITATION STYLE
Tang, Y., Xu, W., Cao, J., Gao, W., Farrell, S., Erichson, B., … Yao, Z. J. (2026). A multimodal large language model for materials science. Nature Machine Intelligence, 8(4), 588–601. https://doi.org/10.1038/s42256-026-01214-y
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.