A Bayesian approach for comparing cross-validated algorithms on multiple data sets

Giorgio Corani; Alessio Benavoli

Journal ArticleOPEN ACCESS

A Bayesian approach for comparing cross-validated algorithms on multiple data sets

Machine Learning (2015) 100(2-3) 285-304

DOI: 10.1007/s10994-015-5486-z

46Citations

73Readers

Abstract

We present a Bayesian approach for making statistical inference about the accuracy (or any other score) of two competing algorithms which have been assessed via cross-validation on multiple data sets. The approach is constituted by two pieces. The first is a novel correlated Bayesian $$t$$t test for the analysis of the cross-validation results on a single data set which accounts for the correlation due to the overlapping training sets. The second piece merges the posterior probabilities computed by the Bayesian correlated $$t$$t test on the different data sets to make inference on multiple data sets. It does so by adopting a Poisson-binomial model. The inferences on multiple data sets account for the different uncertainty of the cross-validation results on the different data sets. It is the first test able to achieve this goal. It is generally more powerful than the signed-rank test if ten runs of cross-validation are performed, as it is anyway generally recommended.

Author supplied keywords

Cite

CITATION STYLE

APA

Corani, G., & Benavoli, A. (2015). A Bayesian approach for comparing cross-validated algorithms on multiple data sets. Machine Learning, 100(2–3), 285–304. https://doi.org/10.1007/s10994-015-5486-z

A Bayesian approach for comparing cross-validated algorithms on multiple data sets

Abstract

Author supplied keywords

Cite

Register to see more suggestions