Mitigating the Multicollinearity Problem and Its Machine Learning Approach: A Review

Jireh Yi Le Chan; Steven Mun Hong Leow; Khean Thye Bea; Wai Khuen Cheng; Seuk Wai Phoong; Zeng Wei Hong; Yen Lin Chen

ArticleOPEN ACCESS

Mitigating the Multicollinearity Problem and Its Machine Learning Approach: A Review

Mathematics

DOI: 10.3390/math10081283

499Citations

711Readers

Abstract

Technologies have driven big data collection across many fields, such as genomics and business intelligence. This results in a significant increase in variables and data points (observations) collected and stored. Although this presents opportunities to better model the relationship between predictors and the response variables, this also causes serious problems during data analysis, one of which is the multicollinearity problem. The two main approaches used to mitigate multicollinearity are variable selection methods and modified estimator methods. However, variable selection methods may negate efforts to collect more data as new data may eventually be dropped from modeling, while recent studies suggest that optimization approaches via machine learning handle data with multicollinearity better than statistical estimators. Therefore, this study details the chronological developments to mitigate the effects of multicollinearity and up-to-date recommendations to better mitigate multicollinearity.

Author supplied keywords

Cite

CITATION STYLE

APA

Chan, J. Y. L., Leow, S. M. H., Bea, K. T., Cheng, W. K., Phoong, S. W., Hong, Z. W., & Chen, Y. L. (2022, April 1). Mitigating the Multicollinearity Problem and Its Machine Learning Approach: A Review. Mathematics. MDPI. https://doi.org/10.3390/math10081283

Mitigating the Multicollinearity Problem and Its Machine Learning Approach: A Review

Abstract

Author supplied keywords

Cite

Register to see more suggestions