A study of pre-validation

Holger Höfling; Robert Tibshirani

Journal ArticleOPEN ACCESS

A study of pre-validation

Annals of Applied Statistics (2008) 2(2) 643-664

DOI: 10.1214/07-AOAS152

25Citations

30Readers

Abstract

Given a predictor of outcome derived from a high-dimensional dataset, pre-validation is a useful technique for comparing it to competing predictors on the same dataset. For microarray data, it allows one to compare a newly derived predictor for disease outcome to standard clinical predictors on the same dataset. We study pre-validation analytically to determine if the inferences drawn from it are valid. We show that while pre-validation generally works well, the straightforward "one degree of freedom" analytical test from pre-validation can be biased and we propose a permutation test to remedy this problem. In simulation studies, we show that the permutation test has the nominal level and achieves roughly the same power as the analytical test. © Institute of Mathematical Statistics.

Author supplied keywords

Cite

CITATION STYLE

APA

Höfling, H., & Tibshirani, R. (2008). A study of pre-validation. Annals of Applied Statistics, 2(2), 643–664. https://doi.org/10.1214/07-AOAS152

A study of pre-validation

Abstract

Author supplied keywords

Cite

Register to see more suggestions