Parallel selection of informative genes for classification

1Citations
Citations of this article
2Readers
Mendeley users who have this article in their library.
Get full text

Abstract

In this paper, we argue that existing gene selection methods are not effective for selecting important genes when the number of samples and the data dimensions grow sufficiently large. As a solution, we propose two approaches for parallel gene selections, both are based on the well known ReliefF feature selection method. In the first design, denoted by PReliefFp, the input data are split into non-overlapping subsets assigned to cluster nodes. Each node carries out gene selection by using the ReliefF method on its own subset, without interaction with other clusters. The final ranking of the genes is generated by gathering the weight vectors from all nodes. In the second design, namely PReliefFg, each node dynamically updates the global weight vectors so the gene selection results in one node can be used to boost the selection of the other nodes. Experimental results from real-world microarray expression data show that PReliefFp and PReliefFg achieve a speedup factor nearly equal to the number of nodes.

Cite

CITATION STYLE

APA

Slavik, M., Zhu, X., Mahgoub, I., & Shoaib, M. (2009). Parallel selection of informative genes for classification. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 5462 LNBI, pp. 388–399). https://doi.org/10.1007/978-3-642-00727-9_36

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free