Machine learning-based analysis of the impact of 5′ untranslated region on protein expression

0Citations
Citations of this article
12Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

The 5′ untranslated region (5′UTR) plays a crucial regulatory role in messenger RNA (mRNA), with modified 5′UTRs extensively utilized in vaccine production, gene therapy, etc. Nevertheless, manually optimizing 5′UTRs may encounter difficulties in balancing the effects of various cis-elements. Consequently, multiple 5′UTR libraries have been created, and machine learning models have been employed to analyze and predict translation efficiency (TE) and protein expression, providing insights into critical regulatory features. On the one hand, these screening libraries, based on TE and mean ribosome load, struggle to accurately quantify protein expression; on the other hand, a precise method for quantifying 5′UTRs necessitates a significantly costlier library. To resolve this dilemma, we constructed a library utilizing firefly luciferase as the reporter to measure accurate protein expression. In addition, we optimized the library construction method by clustering mRNA sequences to reduce redundant data and minimize the size of the dataset. This dual strategy by increasing accuracy and reducing dataset size was found to be effective in predicting the 5′UTRs from the PC3 cell line.

Cite

CITATION STYLE

APA

Wang, L., Liu, S., Huang, J. X., Zhu, H., Li, S., Li, Y., … Zeng, J. (2025). Machine learning-based analysis of the impact of 5′ untranslated region on protein expression. Nucleic Acids Research, 53(17). https://doi.org/10.1093/nar/gkaf861

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free