Accurate prediction of protein folding rates from sequence and sequence-derived residue flexibility and solvent accessibility

Jianzhao Gao; Tuo Zhang; Hua Zhang; Shiyi Shen; Jishou Ruan; Lukasz Kurgan

Journal ArticleOPEN ACCESS

Accurate prediction of protein folding rates from sequence and sequence-derived residue flexibility and solvent accessibility

Proteins: Structure, Function and Bioinformatics (2010) 78(9) 2114-2130

DOI: 10.1002/prot.22727

39Citations

37Readers

Get full text

Abstract

Protein folding rates vary by several orders of magnitude and they depend on the topology of the fold and the size and composition of the sequence. Although recent works show that the rates can be predicted from the sequence, allowing for high-throughput annotations, they consider only the sequence and its predicted secondary structure. We propose a novel sequence-based predictor, PFR-AF, which utilizes solvent accessibility and residue flexibility predicted from the sequence, to improve predictions and provide insights into the folding process. The predictor includes three linear regressions for proteins with two-state, multistate, and unknown (mixed-state) folding kinetics. PFR-AF on average outperforms current methods when tested on three datasets. The proposed approach provides highquality predictions in the absence of similarity between the predicted and the training sequences. The PFR-AF's predictions are characterized by high (between 0.71 and 0.95, depending on the dataset) correlation and the lowest (between 0.75 and 0.9) mean absolute errors with respect to the experimental rates, as measured using out-of-sample tests. Our models reveal that for the twostate chains inclusion of solvent-exposed Ala may accelerate the folding, while increased content of Ile may reduce the folding speed. We also demonstrate that increased flexibility of coils facilitates faster folding and that proteins with larger content of solvent-exposed strands may fold at a slower pace. The increased flexibility of the solvent-exposed residues is shown to elongate folding, which also holds, with a lower correlation, for buried residues. Two case studies are included to support our findings. © 2010 Wiley-Liss, Inc.

Author supplied keywords

Cite

CITATION STYLE

APA

Gao, J., Zhang, T., Zhang, H., Shen, S., Ruan, J., & Kurgan, L. (2010). Accurate prediction of protein folding rates from sequence and sequence-derived residue flexibility and solvent accessibility. Proteins: Structure, Function and Bioinformatics, 78(9), 2114–2130. https://doi.org/10.1002/prot.22727

Accurate prediction of protein folding rates from sequence and sequence-derived residue flexibility and solvent accessibility

Abstract

Author supplied keywords

Cite

Register to see more suggestions