Typically, microarray gene expression data obscure imperative information which is necessary for the understanding of molecular biology processes that occurs in a specific organism with respect to its environment. Uncovering gene expression data’s invisible patterns will lead to a remarkable desire to enhance the interpretation of functional genomics. Biological networks intricacy and the presence of huge amount of genes raise the difficulties of understanding of the high dimension data, which resides lots of measurements. Thus, clustering techniques which are crucial in the data mining process are used as the first step to address this challenge to discover logical structures and predict significant patterns in the hidden data. These patterns may offer shreds of evidence about the biological process related to different physiological conditions. On deep, this paper focuses on the co-clustering algorithm to cluster genes and conditions simultaneously to obtain co-clusters further utilised for classification. The method called an improved network assisted co-clustering for the identification of cancer subtypes (iNCIS). Fundamentally, it integrates gene network information with gene expression to achieve biologically significant clusters. The classes obtained from clusters were used in the classification of genes to improve accuracy. This method applied to breast cancer and glioblastoma multiforme datasets. The discovered structures disclosed strong biological significance associations between functional annotations of genes with related conditions.
CITATION STYLE
Machap, L., Abdullah, A., & Shah, Z. A. (2019). Classification of cancer gene subtypes from clustering of gene expression data. International Journal of Innovative Technology and Exploring Engineering, 8(7), 332–336.
Mendeley helps you to discover research relevant for your work.