Classification and data mining methods are an effective way to classify data, especially in medical field, where those methods are widely used in diagnosis and analysis to make decisions. This paper presents a performance comparison between different machine learning algorithms: Support Vector Machine (SVM), Decision Tree (C4.5), Naive Bayes (NB) and k Nearest Neighbors (k-NN) applied to the Wisconsin Breast Cancer (WBC original) datasets. We use classification accuracy and confusion matrix based on 10-fold cross validation method. We also introduce a fusion at classification level between those classifiers to get the most accurate multi-classifiers approach. Experimental results show that the classification using fusion of SVM, NB and C4.5 reached the highest accuracy (97.31%) while accuracy of using a single classifier SVM is (97.13%). All experiments are executed within a simulation environment and conducted in WEKA data mining tool.
CITATION STYLE
Asri, H., Mousannif, H., & Al Moatassim, H. (2020). A hybrid data mining classifier for breast cancer prediction. In Advances in Intelligent Systems and Computing (Vol. 1103 AISC, pp. 9–16). Springer Science and Business Media Deutschland GmbH. https://doi.org/10.1007/978-3-030-36664-3_2
Mendeley helps you to discover research relevant for your work.