How Deep Learning Tools Can Help Protein Engineers Find Good Sequences

8Citations
Citations of this article
39Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

The deep learning revolution introduced a new and efficacious way to address computational challenges in a wide range of fields, relying on large data sets and powerful computational resources. In protein engineering, we consider the challenge of computationally predicting properties of a protein and designing sequences with these properties. Indeed, accurate and fast deep network oracles for different properties of proteins have been developed. These learn to predict a property from an amino acid sequence by training on large sets of proteins that have this property. In particular, deep networks can learn from the set of all known protein sequences to identify ones that are protein-like. A fundamental challenge when engineering sequences that are both protein-like and satisfy a desired property is that these are rare instances within the vast space of all possible ones. When searching for these very rare instances, one would like to use good sampling procedures. Sampling approaches that are decoupled from the prediction of the property or in which the predictor uses only post-sampling to identify good instances are less efficient. The alternative is to use sampling methods that are geared to generate sequences satisfying and/or optimizing the predictor’s desired properties. Deep learning has a class of architectures, denoted as generative models, which offer the capability of sampling from the learned distribution of a predicted property. Here, we review the use of deep learning tools to find good sequences for protein engineering, including developing oracles/predictors of a property of the proteins and methods that sample from a distribution of protein-like sequences to optimize the desired property.

References Powered by Scopus

Deep residual learning for image recognition

173934Citations
N/AReaders
Get full text

Long Short-Term Memory

76758Citations
N/AReaders
Get full text

Reducing the dimensionality of data with neural networks

17210Citations
N/AReaders
Get full text

Cited by Powered by Scopus

Material transformers: deep learning language models for generative materials design

24Citations
N/AReaders
Get full text

In vitro continuous protein evolution empowered by machine learning and automation

20Citations
N/AReaders
Get full text

Efficient Exploration of Sequence Space by Sequence-Guided Protein Engineering and Design

13Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Osadchy, M., & Kolodny, R. (2021). How Deep Learning Tools Can Help Protein Engineers Find Good Sequences. Journal of Physical Chemistry B, 125(24), 6440–6450. https://doi.org/10.1021/acs.jpcb.1c02449

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 13

72%

Lecturer / Post doc 3

17%

Professor / Associate Prof. 1

6%

Researcher 1

6%

Readers' Discipline

Tooltip

Biochemistry, Genetics and Molecular Bi... 9

60%

Computer Science 2

13%

Chemistry 2

13%

Engineering 2

13%

Save time finding and organizing research with Mendeley

Sign up for free