Binary classification with covariate selection through L0-penalised empirical risk minimisation

Le Yu Chen; Sokbae Lee

Journal ArticleOPEN ACCESS

Binary classification with covariate selection through L0-penalised empirical risk minimisation

Econometrics Journal (2021) 24(1) 103-120

DOI: 10.1093/ectj/utaa017

5Citations

8Readers

Abstract

We consider the problem of binary classification with covariate selection. We construct a classification procedure by minimising the empirical misclassification risk with a penalty on the number of selected covariates. This optimisation problem is equivalent to obtaining an á0-penalised maximum score estimator. We derive probability bounds on the estimated sparsity as well as on the excess misclassification risk. These theoretical results are nonasymptotic and established in a high-dimensional setting. In particular, we show that our method yields a sparse solution whose á0-norm can be arbitrarily close to true sparsity with high probability and obtain the rates of convergence for the excess misclassification risk. We implement the proposed procedure via the method of mixed-integer linear programming. Its numerical performance is illustrated in Monte Carlo experiments and a real data application of the work-trip transportation mode choice.

Author supplied keywords

Cite

CITATION STYLE

APA

Chen, L. Y., & Lee, S. (2021). Binary classification with covariate selection through L0-penalised empirical risk minimisation. Econometrics Journal, 24(1), 103–120. https://doi.org/10.1093/ectj/utaa017

Binary classification with covariate selection through L0-penalised empirical risk minimisation

Abstract

Author supplied keywords

Cite

Register to see more suggestions