Dimensionality reduction of protein mass spectrometry data using random projection

1Citations
Citations of this article
9Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Protein mass spectrometry (MS) pattern recognition has recently emerged as a new method for cancer diagnosis. Unfortunately, classification performance may degrade owing to the enormously high dimensionality of the data. This paper investigates the use of Random Projection in protein MS data dimensionality reduction. The effectiveness of Random Projection (RP) is analyzed and compared against Principal Component Analysis (PCA) by using three classification algorithms, namely Support Vector Machine, Feed-forward Neural Networks and K-Nearest Neighbour. Three real-world cancer data sets are employed to evaluate the performances of RP and PCA. Through the investigations, RP method demonstrated better or at least comparable classification performance as PCA if the dimensionality of the projection matrix is sufficiently large. This paper also explores the use of RP as a pre-processing step prior to PCA. The results show that without sacrificing classification accuracy, performing RP prior to PCA significantly improves the computational time. © Springer-Verlag Berlin Heidelberg 2006.

Cite

CITATION STYLE

APA

Chen, C. L., Weng, K. L., & Chee, P. L. (2006). Dimensionality reduction of protein mass spectrometry data using random projection. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 4233 LNCS-II, pp. 776–785). Springer Verlag. https://doi.org/10.1007/11893257_86

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free