A novel method for accurate one-dimensional protein structure prediction based on fragment matching

23Citations
Citations of this article
33Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

Motivation: The precise prediction of one-dimensional (1D) protein structure as represented by the protein secondary structure and 1D string of discrete state of dihedral angles (i.e. Shape Strings) is a prerequisite for the successful prediction of three-dimensional (3D) structure as well as protein-protein interaction. We have developed a novel 1D structure prediction method, called Frag1D, based on a straightforward fragment matching algorithm and demonstrated its success in the prediction of three sets of 1D structural alphabets, i.e. the classical three-state secondary structure, three- and eight-state Shape Strings. Results: By exploiting the vast protein sequence and protein structure data available, we have brought secondary-structure prediction closer to the expected theoretical limit. When tested by a leave-one-out cross validation on a non-redundant set of PDB cutting at 30% sequence identity containing 5860 protein chains, the overall per-residue accuracy for secondary-structure prediction, i.e. Q3 is 82.9%. The overall per-residue accuracy for three- and eight-state Shape Strings are 85.1 and 71.5%, respectively. We have also benchmarked our program with the latest version of PSIPRED for secondary structure prediction and our program predicted 0.3% better in Q3 when tested on 2241 chains with the same training set. For Shape Strings, we compared our method with a recently published method with the same dataset and definition as used by that method. Our program predicted at 2.2% better in accuracy for three-state Shape Strings. By quantitatively investigating the effect of data base size on 1D structure prediction we show that the accuracy increases by ~1% with every doubling of the database size. Availability: The program is available for download at http://www.fos.su.se/~nanjiang/Frag1D/download. Supplementary data are available at http://www.fos.su.se/~nanjiang/Frag1D/supplement/suppl.html. Contact: svenh@struc.su.se Supplementary information: Supplementary data are available at Bioinformatics online. © The Author 2009. Published by Oxford University Press.

References Powered by Scopus

Gapped BLAST and PSI-BLAST: A new generation of protein database search programs

63174Citations
N/AReaders
Get full text

Dictionary of protein secondary structure: Pattern recognition of hydrogen‐bonded and geometrical features

13418Citations
N/AReaders
Get full text

Protein secondary structure prediction based on position-specific scoring matrices

4722Citations
N/AReaders
Get full text

Cited by Powered by Scopus

PreDNA: Accurate prediction of DNA-binding sites in proteins by integrating sequence and geometric structure information

34Citations
N/AReaders
Get full text

A novel structural position-specific scoring matrix for the prediction of protein secondary structures

33Citations
N/AReaders
Get full text

Structural protein descriptors in 1-dimension and their sequence-based predictions

31Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Zhou, T., Shu, N., & Hovmöller, S. (2009). A novel method for accurate one-dimensional protein structure prediction based on fragment matching. Bioinformatics, 26(4), 470–477. https://doi.org/10.1093/bioinformatics/btp679

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 20

80%

Researcher 4

16%

Lecturer / Post doc 1

4%

Readers' Discipline

Tooltip

Agricultural and Biological Sciences 19

70%

Biochemistry, Genetics and Molecular Bi... 4

15%

Computer Science 3

11%

Earth and Planetary Sciences 1

4%

Save time finding and organizing research with Mendeley

Sign up for free