Brain cancer prediction based on novel interpretable ensemble gene selection algorithm and classifier

3Citations
Citations of this article
11Readers
Mendeley users who have this article in their library.

Abstract

The growth of abnormal cells in the brain causes human brain tumors. Identifying the type of tumor is crucial for the prognosis and treatment of the patient. Data from cancer microarrays typically include fewer samples with many gene expression levels as features, reflecting the curse of dimensionality and making classifying data from microarrays challenging. In most of the examined studies, cancer classification (Malignant and benign) accuracy was examined without disclosing biological information related to the classification process. A new approach was proposed to bridge the gap between cancer classification and the interpretation of the biological studies of the genes implicated in cancer. This study aims to develop a new hybrid model for cancer classification (by using feature selection mRMRe as a key step to improve the performance of classification methods and a distributed hyperparameter optimization for gradient boosting ensemble methods). To evaluate the proposed method, NB, RF, and SVM classifiers have been chosen. In terms of the AUC, sensitivity, and specificity, the optimized CatBoost classifier performed better than the optimized XGBoost in cross-validation 5, 6, 8, and 10. With an accuracy of 0.91 ± 0.12, the optimized CatBoost classifier is more accurate than the CatBoost classifier without optimization, which is 0.81 ± 0.24. By using hybrid algorithms, SVM, RF, and NB automatically become more accurate. Furthermore, in terms of accuracy, SVM and RF (0.97 ± 0.08) achieve equivalent and higher classification accuracy than NB (0.91 ± 0.12). The findings of relevant biomedical studies confirm the findings of the selected genes.

Cite

CITATION STYLE

APA

Almars, A. M., Alwateer, M., Qaraad, M., Amjad, S., Fathi, H., Kelany, A. K., … Elhosseini, M. (2021). Brain cancer prediction based on novel interpretable ensemble gene selection algorithm and classifier. Diagnostics, 11(10). https://doi.org/10.3390/diagnostics11101936

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free