Molecular optimization by capturing chemist’s intuition using deep neural networks

63Citations
Citations of this article
139Readers
Mendeley users who have this article in their library.
Get full text

Abstract

A main challenge in drug discovery is finding molecules with a desirable balance of multiple properties. Here, we focus on the task of molecular optimization, where the goal is to optimize a given starting molecule towards desirable properties. This task can be framed as a machine translation problem in natural language processing, where in our case, a molecule is translated into a molecule with optimized properties based on the SMILES representation. Typically, chemists would use their intuition to suggest chemical transformations for the starting molecule being optimized. A widely used strategy is the concept of matched molecular pairs where two molecules differ by a single transformation. We seek to capture the chemist’s intuition from matched molecular pairs using machine translation models. Specifically, the sequence-to-sequence model with attention mechanism, and the Transformer model are employed to generate molecules with desirable properties. As a proof of concept, three ADMET properties are optimized simultaneously: logD, solubility, and clearance, which are important properties of a drug. Since desirable properties often vary from project to project, the user-specified desirable property changes are incorporated into the input as an additional condition together with the starting molecules being optimized. Thus, the models can be guided to generate molecules satisfying the desirable properties. Additionally, we compare the two machine translation models based on the SMILES representation, with a graph-to-graph translation model HierG2G, which has shown the state-of-the-art performance in molecular optimization. Our results show that the Transformer can generate more molecules with desirable properties by making small modifications to the given starting molecules, which can be intuitive to chemists. A further enrichment of diverse molecules can be achieved by using an ensemble of models.

References Powered by Scopus

SMILES, a Chemical Language and Information System: 1: Introduction to Methodology and Encoding Rules

5163Citations
N/AReaders
Get full text

Effective approaches to attention-based neural machine translation

4120Citations
N/AReaders
Get full text

ρ-σ-π Analysis. A Method for the Correlation of Biological Activity and Chemical Structure

2405Citations
N/AReaders
Get full text

Cited by Powered by Scopus

Chemformer: A pre-trained transformer for computational chemistry

174Citations
N/AReaders
Get full text

Deep learning approaches for de novo drug design: An overview

83Citations
N/AReaders
Get full text

Computer-aided multi-objective optimization in small molecule discovery

52Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

He, J., You, H., Sandström, E., Nittinger, E., Bjerrum, E. J., Tyrchan, C., … Engkvist, O. (2021). Molecular optimization by capturing chemist’s intuition using deep neural networks. Journal of Cheminformatics, 13(1). https://doi.org/10.1186/s13321-021-00497-0

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 37

49%

Researcher 35

46%

Professor / Associate Prof. 4

5%

Readers' Discipline

Tooltip

Chemistry 29

55%

Computer Science 12

23%

Pharmacology, Toxicology and Pharmaceut... 6

11%

Engineering 6

11%

Article Metrics

Tooltip
Mentions
Blog Mentions: 1
News Mentions: 2
Social Media
Shares, Likes & Comments: 41

Save time finding and organizing research with Mendeley

Sign up for free