A boosting‐based machine learning algorithm is presented to model a binary response with large imbalance, i.e., a rare event. The new method (i) reduces the prediction error of the rare class, and (ii) approximates an econometric model that allows interpretability. RiskLogitboost regression includes a weighting mechanism that oversamples or undersamples observations according to their misclassification likelihood and a generalized least squares bias correction strategy to reduce the prediction error. An illustration using a real French third‐party liability motor insurance data set is presented. The results show that RiskLogitboost regression improves the rate of detection of rare events compared to some boosting‐based and tree‐based algorithms and some existing methods designed to treat imbalanced responses.
CITATION STYLE
Pesantez‐narvaez, J., Guillen, M., & Alcañiz, M. (2021). Risklogitboost regression for rare events in binary response: An econometric approach. Mathematics, 9(5). https://doi.org/10.3390/math9050579
Mendeley helps you to discover research relevant for your work.