Classification studies are widely applied in many areas of research. In our study, we are using classification analysis to explore approaches for tackling the classification problem for a large number of measures using partial least square discriminant analysis (PLS-DA) and decision trees (DT). The performance for both methods was compared using a sample data of breast tissues from the University of Wisconsin Hospital. A partial least square discriminant analysis (PLS-DA) and decision trees (DT) predict the diagnosis of breast tissues (M = malignant, B = benign). A total of 699 patients diagnose (458 benign and 241 malignant) are used in this study. The performance of PLS-DA and DT has been evaluated based on the misclassification error and accuracy rate. The results show PLS-DA can be considered as a good and reliable technique to be used when dealing with a large dataset for the classification task and have good prediction accuracy.
CITATION STYLE
Rashid, N. A., Nasaruddin, N., Kassim, K., & Rahim, A. H. A. (2020). Comparison analysis: Large data classification using PLS-DA and decision trees. Mathematics and Statistics, 8(2), 100–105. https://doi.org/10.13189/ms.2020.080205
Mendeley helps you to discover research relevant for your work.