Glycosylphosphatidylinositol (GPI) lipid modification is an important rotein posttranslational modification found in many organisms, and GPI-anchoring is confined to the C-terminus of the target protein. We have developed a novel computational protocol for identifying GPI-anchored proteins, which is more accurate than previously proposed protocols. It uses an optimized support vector machine (SVM) classifier to recognize the C-terminal sequence pattern and uses a voting system based on SignalP version 3.0 to determine the presence or absence of the N-terminal signal of a typical GPI-anchored protein. The SVM classifier shows an accuracy of 96%, and the area under the receiver operating characteristic (ROC) curve is 0.97 under a 5-fold cross-validation test. Fourteen of 15 proteins in our sensitivity test dataset and 19 of the 20 proteins experimentally identified by Hamada et al. that were not included in the training dataset were identified correctly. This suggests that our protocol is considerably effective on unseen data. A proteomewide survey applying the protocol to S. cerevisiae identified 88 roteins as putative GPI-anchored proteins. © Springer-Verlag Berlin Heidelberg 2009.
CITATION STYLE
Cao, W., Sumikoshi, K., Terada, T., Nakamura, S., Kitamoto, K., & Shimizu, K. (2009). Computational protocol for screening gpi-anchored proteins. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 5462 LNBI, pp. 164–175). https://doi.org/10.1007/978-3-642-00727-9_17
Mendeley helps you to discover research relevant for your work.