Clustering-based feature selection framework for microarray data

Smita Chormunge; Sudarson Jena

Journal ArticleOPEN ACCESS

Clustering-based feature selection framework for microarray data

International Journal of Performability Engineering (2017) 13(4) 383-389

DOI: 10.23940/ijpe.17.04.p5.383389

1Citations

8Readers

Abstract

Gene's expression data contains hundreds to thousands of features. It is challenging for machine learning algorithms to find the relevant information from such huge and correlated data. Irrelevant and redundant features are computationally costly and decrease the accuracy of machine learning algorithms. Feature selection plays important role to solve the problem of dimensionality. But most of the traditional feature selection algorithms fail to scale on high dimensionality problems. In this paper Clustering based Feature Selection Framework named as (CFSF) is proposed. CFSF produces optimal feature subset by eliminating irrelevant features using clustering algorithm and redundant features by applying filter measure on each cluster. Extensive experiments are carried out to compare proposed framework and other representative methods with respect to two classifiers namely Naive Bayes and Instance Based on microarray datasets. The empirical study demonstrates that the proposed framework is very efficient and effective for producing optimal feature subset and improves classifier performance.

Author supplied keywords

Cite

CITATION STYLE

APA

Chormunge, S., & Jena, S. (2017). Clustering-based feature selection framework for microarray data. International Journal of Performability Engineering, 13(4), 383–389. https://doi.org/10.23940/ijpe.17.04.p5.383389

Clustering-based feature selection framework for microarray data

Abstract

Author supplied keywords

Cite

Register to see more suggestions