K-means clustering with infinite feature selection for classification tasks in gene expression data

Muhammad Akmal Remli; Kauthar Mohd Daud; Hui Wen Nies; Mohd Saberi Mohamad; Safaai Deris; Sigeru Omatu; Shahreen Kasim; Ghazali Sulong

Conference Proceedings

K-means clustering with infinite feature selection for classification tasks in gene expression data

Advances in Intelligent Systems and Computing (2017) 616 50-57

DOI: 10.1007/978-3-319-60816-7_7

10Citations

14Readers

Get full text

Abstract

In the bioinformatics and clinical research areas, microarray technology has been widely used to distinguish a cancer dataset between normal and tumour samples. However, the high dimensionality of gene expression data affects the classification accuracy of an experiment. Thus, feature selection is needed to select informative genes and remove non-informative genes. Some of the feature selection methods, yet, ignore the interaction between genes. Therefore, the similar genes are clustered together and dissimilar genes are clustered in other groups. Hence, to provide a higher classification accuracy, this research proposed k-means clustering and infinite feature selection for identifying informative genes in the selected subset. This research has been applied to colorectal cancer and small round blue cell tumors datasets. Eventually, this research successfully obtained higher classification accuracy than the previous work.

Author supplied keywords

Cite

CITATION STYLE

APA

Remli, M. A., Daud, K. M., Nies, H. W., Mohamad, M. S., Deris, S., Omatu, S., … Sulong, G. (2017). K-means clustering with infinite feature selection for classification tasks in gene expression data. In Advances in Intelligent Systems and Computing (Vol. 616, pp. 50–57). Springer Verlag. https://doi.org/10.1007/978-3-319-60816-7_7

K-means clustering with infinite feature selection for classification tasks in gene expression data

Abstract

Author supplied keywords

Cite

Register to see more suggestions