A Solution to Separation and Multicollinearity in Multiple Logistic Regression

Jianzhao Shen; Sujuan Gao

Journal ArticleOPEN ACCESS

A Solution to Separation and Multicollinearity in Multiple Logistic Regression

Shen J
Gao S

Journal of Data Science (2021) 6(4) 515-531

DOI: 10.6339/jds.2008.06(4).395

N/ACitations

69Readers

Abstract

In dementia screening tests, item selection for shortening an existing screening test can be achieved using multiple logistic regression. However, maximum likelihood estimates for such logistic regression models often experience serious bias or even non-existence because of separation and multicollinearity problems resulting from a large number of highly correlated items. Firth (1993, Biometrika, 80(1), 27-38) proposed a penalized likelihood estimator for generalized linear models and it was shown to reduce bias and the non-existence problems. The ridge regression has been used in logistic regression to stabilize the estimates in cases of multicollinearity. However, neither solves the problems for each other. In this paper, we propose a double penalized maximum likelihood estimator combining Firth's penalized likelihood equation with a ridge parameter. We present a simulation study evaluating the empirical performance of the double penalized likelihood estimator in small to moderate sample sizes. We demonstrate the proposed approach using a current screening data from a community-based dementia study.

Cite

CITATION STYLE

APA

Shen, J., & Gao, S. (2021). A Solution to Separation and Multicollinearity in Multiple Logistic Regression. Journal of Data Science, 6(4), 515–531. https://doi.org/10.6339/jds.2008.06(4).395

A Solution to Separation and Multicollinearity in Multiple Logistic Regression

Abstract

Cite

Register to see more suggestions