Nonlinear sparse component analysis with a reference: Variable selection in genomics and proteomics

0Citations
Citations of this article
2Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Many scenarios occurring in genomics and proteomics involve small number of labeled data and large number of variables. To create prediction models robust to overfitting variable selection is necessary. We propose variable selection method using nonlinear sparse component analysis with a reference representing either negative (healthy) or positive (cancer) class. Thereby, component comprised of cancer related variables is automatically inferred from the geometry of nonlinear mixture model with a reference. Proposed method is compared with 3 supervised and 2 unsupervised variable selection methods on two-class problems using 2 genomic and 2 proteomic datasets. Obtained results, which include analysis of biological relevance of selected genes, are comparable with those achieved by supervised methods. Thus, proposed method can possibly perform better on unseen data of the same cancer type.

Cite

CITATION STYLE

APA

Kopriva, I., Kapitanović, S., & Čačev, T. (2015). Nonlinear sparse component analysis with a reference: Variable selection in genomics and proteomics. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 9237, pp. 168–175). Springer Verlag. https://doi.org/10.1007/978-3-319-22482-4_19

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free