An imputation method for estimating the learning curve in classification problems

2Citations
Citations of this article
1Readers
Mendeley users who have this article in their library.
Get full text

Abstract

The learning curve expresses the error rate of a predictive modeling procedure, when applied to a particular population, as a function of the sample size of the training dataset. It typically is a decreasing function with a positive limiting value (bounded below by the Bayes error rate). An estimate of the learning curve can be used to assess whether a modeling procedure is expected to become substantially more accurate if additional training data were obtained. Here, we consider an imputation-based procedure for estimating learning curves. We focus on classification, although the idea is applicable to other predictive modeling settings. Simulation studies indicate that useful estimates of learning curves can be obtained for roughly a four-fold increase in the size of the training set relative to the available data, and that the proposed imputation approach outperforms an alternative estimation approach based on parameterizing the learning curve. We illustrate the method with an application that predicts the risk of disease progression for people with chronic lymphocytic leukemia.

Cite

CITATION STYLE

APA

Laber, E. B., Shedden, K., & Yang, Y. (2016). An imputation method for estimating the learning curve in classification problems. In Abel Symposia (Vol. 11, pp. 189–209). Springer Heidelberg. https://doi.org/10.1007/978-3-319-27099-9_9

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free