Deep generative models for T cell receptor protein sequences

45Citations
Citations of this article
134Readers
Mendeley users who have this article in their library.

Abstract

Probabilistic models of adaptive immune repertoire sequence distributions can be used to infer the expansion of immune cells in response to stimulus, differentiate genetic from environmental factors that determine repertoire sharing, and evaluate the suitability of various target immune sequences for stimulation via vaccination. Classically, these models are defined in terms of a probabilistic V(D)J recombination model which is sometimes combined with a selection model. In this paper we take a different approach, fitting variational autoencoder (VAE) models parameterized by deep neural networks to T cell receptor (TCR) repertoires. We show that simple VAE models can perform accurate cohort frequency estimation, learn the rules of VDJ recombination, and generalize well to unseen sequences. Further, we demonstrate that VAE-like models can distinguish between real sequences and sequences generated according to a recombination-selection model, and that many characteristics of VAE-generated sequences are similar to those of real sequences.

References Powered by Scopus

Biopython: Freely available Python tools for computational molecular biology and bioinformatics

3377Citations
N/AReaders
Get full text

Jupyter Notebooks—a publishing format for reproducible computational workflows

2714Citations
N/AReaders
Get full text

The mechanism and regulation of chromosomal V(D)J recombination

747Citations
N/AReaders
Get full text

Cited by Powered by Scopus

De novo protein design by deep network hallucination

320Citations
N/AReaders
Get full text

An evolution-based model for designing chorismate mutase enzymes

171Citations
N/AReaders
Get full text

Deep Learning in Protein Structural Modeling and Design

147Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Davidsen, K., Olson, B. J., DeWitt, W. S., Feng, J., Harkins, E., Bradley, P., & Matsen, F. A. (2019). Deep generative models for T cell receptor protein sequences. ELife, 8. https://doi.org/10.7554/eLife.46935

Readers over time

‘19‘20‘21‘22‘23‘24015304560

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 40

55%

Researcher 31

42%

Professor / Associate Prof. 2

3%

Readers' Discipline

Tooltip

Biochemistry, Genetics and Molecular Bi... 27

39%

Agricultural and Biological Sciences 21

30%

Immunology and Microbiology 13

19%

Computer Science 8

12%

Save time finding and organizing research with Mendeley

Sign up for free
0