Unifying and generalizing methods for removing unwanted variation based on negative controls

David Gerard; Matthew Stephens

Journal ArticleOPEN ACCESS

Unifying and generalizing methods for removing unwanted variation based on negative controls

Statistica Sinica (2021) 31(3) 1145-1166

DOI: 10.5705/ss.202018.0345

3Citations

25Readers

Get full text

Abstract

Unwanted variation, including hidden confounding, is a well-known problem in many fields, but particularly in large-scale gene expression studies. Recent proposals to use control genes, genes assumed to be unassociated with the covariates of interest, have led to new methods to deal with this problem. Several versions of these removing unwanted variation (RUV) methods have been proposed, including RUV1, RUV2, RUV4, RUVinv, RUVrinv, and RUVfun. Here, we introduce a general framework, RUV*, that both unites and generalizes these approaches. This unifying framework helps clarify the connections between existing methods. In particular, we provide conditions under which RUV2 and RUV4 are equivalent. The RUV* framework preserves an advantage of the RUV approaches, namely, their modularity, which facilitates the development of novel methods based on existing matrix imputation algorithms. We illustrate this by implementing RUVB, a version of RUV* based on Bayesian factor analysis. In realistic simulations based on real data, we found RUVB to be competitive with existing methods in terms of both power and calibration. However, providing a consistently reliable calibration among the data sets remains challenging.

Author supplied keywords

Cite

CITATION STYLE

APA

Gerard, D., & Stephens, M. (2021). Unifying and generalizing methods for removing unwanted variation based on negative controls. Statistica Sinica, 31(3), 1145–1166. https://doi.org/10.5705/ss.202018.0345

Unifying and generalizing methods for removing unwanted variation based on negative controls

Abstract

Author supplied keywords

Cite

Register to see more suggestions