An imputation method for estimating the learning curve in classification problems

Eric B. Laber; Kerby Shedden; Yang Yang

Conference Proceedings

An imputation method for estimating the learning curve in classification problems

Abel Symposia (2016) 11 189-209

DOI: 10.1007/978-3-319-27099-9_9

2Citations

1Readers

Get full text

Abstract

The learning curve expresses the error rate of a predictive modeling procedure, when applied to a particular population, as a function of the sample size of the training dataset. It typically is a decreasing function with a positive limiting value (bounded below by the Bayes error rate). An estimate of the learning curve can be used to assess whether a modeling procedure is expected to become substantially more accurate if additional training data were obtained. Here, we consider an imputation-based procedure for estimating learning curves. We focus on classification, although the idea is applicable to other predictive modeling settings. Simulation studies indicate that useful estimates of learning curves can be obtained for roughly a four-fold increase in the size of the training set relative to the available data, and that the proposed imputation approach outperforms an alternative estimation approach based on parameterizing the learning curve. We illustrate the method with an application that predicts the risk of disease progression for people with chronic lymphocytic leukemia.

Cite

CITATION STYLE

APA

Laber, E. B., Shedden, K., & Yang, Y. (2016). An imputation method for estimating the learning curve in classification problems. In Abel Symposia (Vol. 11, pp. 189–209). Springer Heidelberg. https://doi.org/10.1007/978-3-319-27099-9_9

An imputation method for estimating the learning curve in classification problems

Abstract

Cite

Register to see more suggestions