Classification methods are fundamental techniques designed to find mathematical models that are able to recognize the membership of each object to its proper class on the basis of a set of measurements. The issue of classifying objects into groups when variables in an experiment are large will cause the misclassification problems. This study explores the approaches for tackling the classification problem of a large number of independent variables using parametric method namely PLS-DA and PCA+LDA. Data are generated using data simulator; Azure Machine Learning (AML) studio through custom R module. The performance analysis of the PLS-DA was conducted and compared with PCA+LDA model using different number of variables (p) and different sample sizes (n). The performance of PLS-DA and PCA+LDA has been evaluated based on minimum misclassification rate. The results demonstrated that PLS-DA performed better than the PCA+LDA for large sample size. PLS-DA can be considered to have a good and reliable technique to be used when dealing with large datasets for classification task.
CITATION STYLE
Rashid, N. A., Hussain, W. S. E. C., Ahmad, A. R., & Abdullah, F. N. (2019). Performance of classification analysis: A comparative study between PLS-DA and integrating PCA+LDA. Mathematics and Statistics, 7(4), 24–28. https://doi.org/10.13189/ms.2019.070704
Mendeley helps you to discover research relevant for your work.