Random forest classifier combined with feature selection for breast cancer diagnosis and prognostic

Cuong Nguyen; Yong Wang; Ha Nam Nguyen

Journal ArticleOPEN ACCESS

Random forest classifier combined with feature selection for breast cancer diagnosis and prognostic

Nguyen C
Wang Y
Nguyen H

Journal of Biomedical Science and Engineering (2013) 06(05) 551-560

DOI: 10.4236/jbise.2013.65070

N/ACitations

249Readers

Abstract

As the incidence of this disease has increased significantly in the recent years, expert systems and machine learning techniques to this problem have also taken a great attention from many scholars. This study aims at diagnosing and prognosticating breast cancer with a machine learning method based on random forest classifier and feature selection technique. By weighting, keeping useful features and removing redundant features in datasets, the method was obtained to solve diagnosis problems via classifying Wisconsin Breast Cancer Diagnosis Dataset and to solve prognosis problem via classifying Wisconsin Breast Cancer Prognostic Dataset. On these datasets we obtained classification accuracy of 100% in the best case and of around 99.8% on average. This is very promising compared to the previously reported results. This result is for Wisconsin Breast Cancer Dataset but it states that this method can be used confidently for other breast cancer diagnosis problems, too.

Cite

CITATION STYLE

APA

Nguyen, C., Wang, Y., & Nguyen, H. N. (2013). Random forest classifier combined with feature selection for breast cancer diagnosis and prognostic. Journal of Biomedical Science and Engineering, 06(05), 551–560. https://doi.org/10.4236/jbise.2013.65070

Random forest classifier combined with feature selection for breast cancer diagnosis and prognostic

Abstract

Cite

Register to see more suggestions