The importance of variable selection in regression has grown in recent years as computing power has encouraged the modelling of data sets of ever-increasing size. Data mining applications in finance, marketing and bioinformatics are obvious examples. A limitation of nearly all existing variable selection methods is the need to specify the correct model before selection. When the number of predictors is large, model formulation and validation can be difficult or even infeasible. On the basis of the theory of sufficient dimension reduction, we propose a new class of model-free variable selection approaches. The methods proposed assume no model of any form, require no nonparametric smoothing and allow for general predictor effects. The efficacy of the methods proposed is demonstrated via simulation, and an empirical example is given. © 2005 Royal Statistical Society.
CITATION STYLE
Li, L., Cook, R. D., & Nachtsheim, C. J. (2005). Model-free variable selection. Journal of the Royal Statistical Society. Series B: Statistical Methodology, 67(2), 285–299. https://doi.org/10.1111/j.1467-9868.2005.00502.x
Mendeley helps you to discover research relevant for your work.