A Study of Cross-Validation and Bootstrap for Accuracy Estimation and Model Selection

Ron Kohavi

Conference Proceedings

A Study of Cross-Validation and Bootstrap for Accuracy Estimation and Model Selection

Kohavi R

IJCAI International Joint Conference on Artificial Intelligence (1995) 2 1137-1143

ISSN: 10450823

10.5kCitations

7.1kReaders

Abstract

We review accuracy estimation methods and compare the two most common methods cross validation and bootstrap Recent experimental results on artificial data and theoretical re cults m restricted settings have shown that for selecting a good classifier from a set of classifiers (model selection), ten-fold cross-validation may be better than the more expensive ka\p one-out cross-validation We report on a large scale experiment-over half a million runs of C4 5 and a Naive-Bayes algorithm-loestimale the effects of different parameters on these al gonthms on real-world datascts For cross validation we vary the number of folds and whether the folds arc stratified or not, for bootstrap, we vary the number of bootstrap samples Our results indicate that for real-word datasets similar to ours, The best method lo use for model selection is ten fold stratified cross validation even if computation power allows using more folds.

Cite

CITATION STYLE

APA

Kohavi, R. (1995). A Study of Cross-Validation and Bootstrap for Accuracy Estimation and Model Selection. In IJCAI International Joint Conference on Artificial Intelligence (Vol. 2, pp. 1137–1143). International Joint Conferences on Artificial Intelligence.

A Study of Cross-Validation and Bootstrap for Accuracy Estimation and Model Selection

Abstract

Cite

Register to see more suggestions