Abstract
Massively parallel reporter assays (MPRAs) are powerful tools for quantifying the impacts of sequence variation on gene expression. Reading out molecular phenotypes with sequencing enables interrogating the impact of sequence variation beyond genome scale. Machine learning models integrate and codify information learned from MPRAs and enable generalization by predicting sequences outside the training data set. Models can provide a quantitative understanding of cis-regulatory codes controlling gene expression, enable variant stratification, and guide the design of synthetic regulatory elements for applications from synthetic biology to mRNA and gene therapy. This review focuses on cis-regulatory MPRAs, particularly those that interrogate cotranscriptional and post-transcriptional processes: alternative splicing, cleavage and polyadenylation, translation, and mRNA decay.
Author supplied keywords
Cite
CITATION STYLE
Fleur, A. L., Shi, Y., & Seelig, G. (2024, September 1). Decoding biology with massively parallel reporter assays and machine learning. Genes and Development. Cold Spring Harbor Laboratory Press. https://doi.org/10.1101/gad.351800.124
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.