ReactionDataExtractor 2.0: A Deep Learning Approach for Data Extraction from Chemical Reaction Schemes

15Citations
Citations of this article
35Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

Knowledge in the chemical domain is often disseminated graphically via chemical reaction schemes. The task of describing chemical transformations is greatly simplified by introducing reaction schemes that are composed of chemical diagrams and symbols. While intuitively understood by any chemist, like most graphical representations, such drawings are not easily understood by machines; this poses a challenge in the context of data extraction. Currently available tools are limited in their scope of extraction and require manual preprocessing, thus slowing down the speed of data extraction. We present a new tool, ReactionDataExtractor v2.0, which uses a combination of neural networks and symbolic artificial intelligence to effectively remove this barrier. We have evaluated our tool on a test set composed of reaction schemes that were taken from open-source journal articles and realized F1 score metrics between 75 and 96%. These evaluation metrics can be further improved by tuning our object-detection models to a specific chemical subdomain thanks to a data-driven approach that we have adopted with synthetically generated data. The system architecture of our tool is modular, which allows it to balance speed and accuracy to afford an autonomous, high-throughput solution for image-based chemical data extraction.

References Powered by Scopus

U-net: Convolutional networks for biomedical image segmentation

66622Citations
N/AReaders
Get full text

SSD: Single shot multibox detector

25277Citations
N/AReaders
Get full text

Aggregated residual transformations for deep neural networks

7845Citations
N/AReaders
Get full text

Cited by Powered by Scopus

Accelerating materials language processing with large language models

16Citations
N/AReaders
Get full text

Fine-tuning large language models for chemical text mining

16Citations
N/AReaders
Get full text

Artificial intelligence (AI) futures: India-UK collaborations emerging from the 4th Royal Society Yusuf Hamied workshop

9Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Wilary, D. M., & Cole, J. M. (2023). ReactionDataExtractor 2.0: A Deep Learning Approach for Data Extraction from Chemical Reaction Schemes. Journal of Chemical Information and Modeling, 63(19), 6053–6067. https://doi.org/10.1021/acs.jcim.3c00422

Readers' Seniority

Tooltip

Researcher 10

67%

PhD / Post grad / Masters / Doc 3

20%

Professor / Associate Prof. 2

13%

Readers' Discipline

Tooltip

Chemistry 7

54%

Biochemistry, Genetics and Molecular Bi... 4

31%

Social Sciences 1

8%

Engineering 1

8%

Save time finding and organizing research with Mendeley

Sign up for free