Predictive Chemistry Augmented with Text Retrieval

4Citations
Citations of this article
14Readers
Mendeley users who have this article in their library.

Abstract

This paper focuses on using natural language descriptions to enhance predictive models in the chemistry field. Conventionally, chemoinformatics models are trained with extensive structured data manually extracted from the literature. In this paper, we introduce TextReact, a novel method that directly augments predictive chemistry with texts retrieved from the literature. TextReact retrieves text descriptions relevant for a given chemical reaction, and then aligns them with the molecular representation of the reaction. This alignment is enhanced via an auxiliary masked LM objective incorporated in the predictor training. We empirically validate the framework on two chemistry tasks: reaction condition recommendation and one-step retrosynthesis. By leveraging text retrieval, TextReact significantly outperforms state-of-the-art chemoinformatics models trained solely on molecular data.

Cite

CITATION STYLE

APA

Qian, Y., Li, Z., Tu, Z., Coley, C. W., & Barzilay, R. (2023). Predictive Chemistry Augmented with Text Retrieval. In EMNLP 2023 - 2023 Conference on Empirical Methods in Natural Language Processing, Proceedings (pp. 12731–12745). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2023.emnlp-main.784

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free