This paper proposes an unsupervised gene selection algorithm based on the singular value decomposition (SVD) to determine the most informative genes from a cancer gene expression dataset. These genes are important for many tasks including cancer clustering and classification, data compression, and samples characterization. The proposed algorithm is designed by making use of the SVD's clustering capability to find the natural groupings of the genes. The most informative genes are then determined by selecting the closest genes to the corresponding cluster's centers. These genes are then used to construct a new (pruned) dataset of the same samples but with less dimensionality. The experimental results using some standard datasets in cancer research show that the proposed algorithm can reliably improve performances of the SVD and kmeans algorithm in cancer clustering tasks. © Springer Science+Business Media Singapore 2014.
CITATION STYLE
Mirzal, A. (2014). SVD based gene selection algorithm. In Lecture Notes in Electrical Engineering (Vol. 285 LNEE, pp. 223–230). Springer Verlag. https://doi.org/10.1007/978-981-4585-18-7_26
Mendeley helps you to discover research relevant for your work.