Bagging msa learning: enhancing low-quality pssm with deep learning for accurate protein structure property prediction

Yuzhi Guo; Jiaxiang Wu; Hehuan Ma; Sheng Wang; Junzhou Huang

Conference Proceedings

Bagging msa learning: enhancing low-quality pssm with deep learning for accurate protein structure property prediction

Guo Y
Wu J
Ma H
et al.

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2020) 12074 LNBI 88-103

DOI: 10.1007/978-3-030-45257-5_6

16Citations

9Readers

Get full text

Abstract

Accurate predictions of protein structure properties, e.g. secondary structure and solvent accessibility, are essential in analyzing the structure and function of a protein. PSSM (Position-Specific Scoring Matrix) features are widely used in the structure property prediction. However, some proteins may have low-quality PSSM features due to insufficient homologous sequences, leading to limited prediction accuracy. To address this limitation, we propose an enhancing scheme for PSSM features. We introduce the “Bagging MSA” method to calculate PSSM features used to train our model, and adopt a convolutional network to capture local context features and bidirectional-LSTM for long-term dependencies, and integrate them under an unsupervised framework. Structure property prediction models are then built upon such enhanced PSSM features for more accurate predictions. Empirical evaluation of CB513, CASP11, and CASP12 datasets indicate that our unsupervised enhancing scheme indeed generates more informative PSSM features for structure property prediction.

Author supplied keywords

Cite

CITATION STYLE

APA

Guo, Y., Wu, J., Ma, H., Wang, S., & Huang, J. (2020). Bagging msa learning: enhancing low-quality pssm with deep learning for accurate protein structure property prediction. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 12074 LNBI, pp. 88–103). Springer. https://doi.org/10.1007/978-3-030-45257-5_6

Bagging msa learning: enhancing low-quality pssm with deep learning for accurate protein structure property prediction

Abstract

Author supplied keywords

Cite

Register to see more suggestions