Compositional Bias of Intrinsically Disordered Proteins and Regions and Their Predictions

20Citations
Citations of this article
9Readers
Mendeley users who have this article in their library.

Abstract

Intrinsically disordered regions (IDRs) carry out many cellular functions and vary in length and placement in protein sequences. This diversity leads to variations in the underlying compositional biases, which were demonstrated for the short vs. long IDRs. We analyze compositional biases across four classes of disorder: fully disordered proteins; short IDRs; long IDRs; and binding IDRs. We identify three distinct biases: for the fully disordered proteins, the short IDRs and the long and binding IDRs combined. We also investigate compositional bias for putative disorder produced by leading disorder predictors and find that it is similar to the bias of the native disorder. Interest-ingly, the accuracy of disorder predictions across different methods is correlated with the correct-ness of the compositional bias of their predictions highlighting the importance of the compositional bias. The predictive quality is relatively low for the disorder classes with compositional bias that is the most different from the “generic” disorder bias, while being much higher for the classes with the most similar bias. We discover that different predictors perform best across different classes of disorder. This suggests that no single predictor is universally best and motivates the development of new architectures that combine models that target specific disorder classes.

Cite

CITATION STYLE

APA

Zhao, B., & Kurgan, L. (2022). Compositional Bias of Intrinsically Disordered Proteins and Regions and Their Predictions. Biomolecules, 12(7). https://doi.org/10.3390/biom12070888

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free