Abstract
The protein databases contain a huge number of function unknown proteins, including many proteins with newly determined 3D structures resulted from the Structural Genomics Projects. To accelerate experiment-based assignment of function, de novo prediction of protein functional sites, like active sites in enzymes, becomes increasingly important. Here, we attempted to improve the prediction of catalytic residues in enzyme structures by seeking and refining different encodings (i.e. residue properties) as well as employing new machine learning algorithms. In particular, considering that catalytic residues can often reveal specific network centrality when representing enzyme structure as a residue contact network, the corresponding measurement (i.e. closeness centrality) was used as one of the most important encodings in our new predictor. Meanwhile, a genetic algorithm integrated neural network (GANN) was also employed. Thanks to the above strategies, our GANN predictor demonstrated a high accuracy of 91.2% in the prediction of catalytic residues based on balanced datasets (i.e. the 1:1 ratio of catalytic to non-catalytic residues). When the GANN method was optimally applied to real enzyme structures, 73.9% of the tested structures had the active site correctly located. Compared with two existing methods, the proposed GANN method also demonstrated a better performance. © The Author 2008. Published by Oxford University Press. All rights reserved.
Author supplied keywords
Cite
CITATION STYLE
Tang, Y. R., Sheng, Z. Y., Chen, Y. Z., & Zhang, Z. (2008). An improved prediction of catalytic residues in enzyme structures. Protein Engineering, Design and Selection, 21(5), 295–302. https://doi.org/10.1093/protein/gzn003
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.