PeakSelect: Preprocessing tandem mass spectra for better peptide identification

22Citations
Citations of this article
21Readers
Mendeley users who have this article in their library.

Abstract

We present a new preprocessing method, PeakSelect, to improve the accuracy and efficiency of Tandem Mass-Spec peptide (protein) identification. The fundamental difference between noise and fragment ions in spectra is that ions have isotopes but noise does not. We propose a new and important concept of an Isotope Pattern Vector (IPV) which characterizes the isotope cluster of fragment ions. Then the noise and real peaks can be distinguished by the quantitative IPV values. PeakSelect first uses a new method of the Gaussian Mixture Model and Expectation-Maximization (EM) algorithm to find the base intensity level (baseline) in a spectrum. Then PeakSelect selects features based on the IPV and baseline, and constructs a decision tree to automatically classify the peaks into different categories such as noise, single ion peaks, and overlapping peaks. Experiments show that PeakSelect can help to reduce the Mascot searching time and increase the reliability of peptide identifications. In particular, PeakSelect performs well on complex spectra with a large number of peaks from large peptides, and supports more sequence identification than other well-known systems. Copyright © 2008 John Wiley & Sons, Ltd.

Cite

CITATION STYLE

APA

Zhang, J., He, S., Ling, C. X., Cao, X., Zeng, R., & Gao, W. (2008). PeakSelect: Preprocessing tandem mass spectra for better peptide identification. Rapid Communications in Mass Spectrometry, 22(8), 1203–1212. https://doi.org/10.1002/rcm.3488

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free