Nonlinear sparse component analysis with a reference: Variable selection in genomics and proteomics

Ivica Kopriva; Sanja Kapitanović; Tamara Čačev

Conference Proceedings

Nonlinear sparse component analysis with a reference: Variable selection in genomics and proteomics

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2015) 9237 168-175

DOI: 10.1007/978-3-319-22482-4_19

0Citations

2Readers

Get full text

Abstract

Many scenarios occurring in genomics and proteomics involve small number of labeled data and large number of variables. To create prediction models robust to overfitting variable selection is necessary. We propose variable selection method using nonlinear sparse component analysis with a reference representing either negative (healthy) or positive (cancer) class. Thereby, component comprised of cancer related variables is automatically inferred from the geometry of nonlinear mixture model with a reference. Proposed method is compared with 3 supervised and 2 unsupervised variable selection methods on two-class problems using 2 genomic and 2 proteomic datasets. Obtained results, which include analysis of biological relevance of selected genes, are comparable with those achieved by supervised methods. Thus, proposed method can possibly perform better on unseen data of the same cancer type.

Author supplied keywords

Cite

CITATION STYLE

APA

Kopriva, I., Kapitanović, S., & Čačev, T. (2015). Nonlinear sparse component analysis with a reference: Variable selection in genomics and proteomics. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 9237, pp. 168–175). Springer Verlag. https://doi.org/10.1007/978-3-319-22482-4_19

Nonlinear sparse component analysis with a reference: Variable selection in genomics and proteomics

Abstract

Author supplied keywords

Cite

Register to see more suggestions