Hybrid Feature Selection and Ensemble Learning Methods for Gene Selection and Cancer Classification

11Citations
Citations of this article
15Readers
Mendeley users who have this article in their library.

Abstract

A promising research field in bioinformatics and data mining is the classification of cancer based on gene expression results. Efficient sample classification is not supported by all genes. Thus, to identify the appropriate genes that help efficiently distinguish samples, a robust feature selection method is needed. Redundancy in the data on gene expression contributes to low classification performance. This paper presents the combination for gene selection and classification methods using ranking and wrapper methods. In ranking methods, information gain was used to reduce the size of dimensionality to 1% and 5%. Then, in wrapper methods K-nearest neighbors and Naïve Bayes were used with Best First, Greedy Stepwise, and Rank Search. Several combinations were investigated because it is known that no single model can give the best results using different datasets for all circumstances. Therefore, combining multiple feature selection methods and applying different classification models could provide a better decision on the final predicted cancer types. Compared with the existing classifiers, the proposed assembly gene selection methods obtained comparable performance.

Cite

CITATION STYLE

APA

Qasem, S. N., & Saeed, F. (2021). Hybrid Feature Selection and Ensemble Learning Methods for Gene Selection and Cancer Classification. International Journal of Advanced Computer Science and Applications, 12(2), 193–200. https://doi.org/10.14569/IJACSA.2021.0120225

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free