Methods that remove batch effects while retaining group differences may lead to exaggerated confidence in downstream analyses

Vegard Nygaard; Einar Andreas Rødland; Eivind Hovig

Journal ArticleOPEN ACCESS

Methods that remove batch effects while retaining group differences may lead to exaggerated confidence in downstream analyses

Biostatistics (2016) 17(1) 29-39

DOI: 10.1093/biostatistics/kxv027

250Citations

665Readers

Abstract

Removal of, or adjustment for, batch effects or center differences is generally required when such effects are present in data. In particular, when preparing microarray gene expression data from multiple cohorts, array platforms, or batches for later analyses, batch effects can have confounding effects, inducing spurious differences between study groups. Many methods and tools exist for removing batch effects from data. However, when study groups are not evenly distributed across batches, actual group differences may induce apparent batch differences, in which case batch adjustments may bias, usually deflate, group differences. Some tools therefore have the option of preserving the difference between study groups, e.g. using a two-way ANOVA model to simultaneously estimate both group and batch effects. Unfortunately, this approach may systematically induce incorrect group differences in downstream analyses when groups are distributed between the batches in an unbalanced manner. The scientific community seems to be largely unaware of how this approach may lead to false discoveries.

Author supplied keywords

Cite

CITATION STYLE

APA

Nygaard, V., Rødland, E. A., & Hovig, E. (2016). Methods that remove batch effects while retaining group differences may lead to exaggerated confidence in downstream analyses. Biostatistics, 17(1), 29–39. https://doi.org/10.1093/biostatistics/kxv027

Methods that remove batch effects while retaining group differences may lead to exaggerated confidence in downstream analyses

Abstract

Author supplied keywords

Cite

Register to see more suggestions