A quasi-Monte-Carlo comparison of parametric and semiparametric regression methods for heavy-tailed and non-normal data: an application to healthcare costs

Andrew M. Jones; James Lomas; Peter T. Moore; Nigel Rice

Journal ArticleOPEN ACCESS

A quasi-Monte-Carlo comparison of parametric and semiparametric regression methods for heavy-tailed and non-normal data: an application to healthcare costs

Journal of the Royal Statistical Society. Series A: Statistics in Society (2016) 179(4) 951-974

DOI: 10.1111/rssa.12141

24Citations

30Readers

Abstract

We conduct a quasi-Monte-Carlo comparison of the recent developments in parametric and semiparametric regression methods for healthcare costs, both against each other and against standard practice. The population of English National Health Service hospital in-patient episodes for the financial year 2007–2008 (summed for each patient) is randomly divided into two equally sized subpopulations to form an estimation set and a validation set. Evaluating out-of-sample using the validation set, a conditional density approximation estimator shows considerable promise in forecasting conditional means, performing best for accuracy of forecasting and among the best four for bias and goodness of fit. The best performing model for bias is linear regression with square-root-transformed dependent variables, whereas a generalized linear model with square-root link function and Poisson distribution performs best in terms of goodness of fit. Commonly used models utilizing a log-link are shown to perform badly relative to other models considered in our comparison.

Author supplied keywords

Cite

CITATION STYLE

APA

Jones, A. M., Lomas, J., Moore, P. T., & Rice, N. (2016). A quasi-Monte-Carlo comparison of parametric and semiparametric regression methods for heavy-tailed and non-normal data: an application to healthcare costs. Journal of the Royal Statistical Society. Series A: Statistics in Society, 179(4), 951–974. https://doi.org/10.1111/rssa.12141

A quasi-Monte-Carlo comparison of parametric and semiparametric regression methods for heavy-tailed and non-normal data: an application to healthcare costs

Abstract

Author supplied keywords

Cite

Register to see more suggestions