Stochastic gradient boosting

Jerome H. Friedman

Journal Article

Stochastic gradient boosting

Friedman J

Computational Statistics and Data Analysis (2002) 38(4) 367-378

DOI: 10.1016/S0167-9473(01)00065-2

4.6kCitations

2.9kReaders

Get full text

Abstract

Gradient boosting constructs additive regression models by sequentially fitting a simple parameterized function (base learner) to current "pseudo" -residuals by least squares at each iteration. The pseudo-residuals are the gradient of the loss functional being minimized, with respect to the model values at each training data point evaluated at the current step. It is shown that both the approximation accuracy and execution speed of gradient boosting can be substantially improved by incorporating randomization into the procedure. Specifically, at each iteration a subsample of the training data is drawn at random (without replacement) from the full training data set. This randomly selected subsample is then used in place of the full sample to fit the base learner and compute the model update for the current iteration. This randomized approach also increases robustness against overcapacity of the base learner. © 2002 Elsevier Science B.V. All rights reserved.

Cite

CITATION STYLE

APA

Friedman, J. H. (2002). Stochastic gradient boosting. Computational Statistics and Data Analysis, 38(4), 367–378. https://doi.org/10.1016/S0167-9473(01)00065-2

Stochastic gradient boosting

Abstract

Cite

Register to see more suggestions