A classification framework applied to cancer gene expression profiles

Hussein Hijazi; Christina Chan

Journal ArticleOPEN ACCESS

A classification framework applied to cancer gene expression profiles

Journal of Healthcare Engineering (2013) 4(2) 255-283

DOI: 10.1260/2040-2295.4.2.255

52Citations

70Readers

Abstract

Classification of cancer based on gene expression has provided insight into possible treatment strategies. Thus, developing machine learning methods that can successfully distinguish among cancer subtypes or normal versus cancer samples is important. This work discusses supervised learning techniques that have been employed to classify cancers. Furthermore, a two-step feature selection method based on an attribute estimation method (e.g., ReliefF) and a genetic algorithm was employed to find a set of genes that can best differentiate between cancer subtypes or normal versus cancer samples. The application of different classification methods (e.g., decision tree, k-nearest neighbor, support vector machine (SVM), bagging, and random forest) on 5 cancer datasets shows that no classification method universally outperforms all the others. However, k-nearest neighbor and linear SVM generally improve the classification performance over other classifiers. Finally, incorporating diverse types of genomic data (e.g., protein-protein interaction data and gene expression) increase the prediction accuracy as compared to using gene expression alone.

Author supplied keywords

Cite

CITATION STYLE

APA

Hijazi, H., & Chan, C. (2013). A classification framework applied to cancer gene expression profiles. Journal of Healthcare Engineering, 4(2), 255–283. https://doi.org/10.1260/2040-2295.4.2.255

A classification framework applied to cancer gene expression profiles

Abstract

Author supplied keywords

Cite

Register to see more suggestions