Using LASSO to Model Interactions and Nonlinearities in Survey Data

  • Signorino C
  • Kirchner A
N/ACitations
Citations of this article
36Readers
Mendeley users who have this article in their library.

Abstract

The LASSO and its variants have become a core part of the machine learning toolkit. Similar to OLS and logistic regression, the LASSO can be applied to continuous or binary data. The LASSO is a form of penalized regression, shrinking some coefficients exactly to zero. Because of that, it is especially useful for variable selection — for example, in situations where there are many potential covariates, only a few of which are likely relevant. In this article, we introduce the LASSO (and Adaptive LASSO) and show how it can be applied in situations where the researcher thinks the outcome variable is a nonlinear and/or interacted function of the covariates. Our motivating example is survey response. We provide an example showing how to model survey response using the LASSO and a polynomial expansion of the covariates. Our resulting model has better out-of-sample prediction for survey response than does a traditional logistic regression model. Example R code is provided in the supplemental materials.

Cite

CITATION STYLE

APA

Signorino, C. S., & Kirchner, A. (2018). Using LASSO to Model Interactions and Nonlinearities in Survey Data. Survey Practice, 11(1), 1–10. https://doi.org/10.29115/sp-2018-0005

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free