A non-randomized procedure for large-scale heterogeneous multiple discrete testing based on randomized tests

Xiaoyu Dai; Nan Lin; Daofeng Li; Ting Wang

Journal Article

A non-randomized procedure for large-scale heterogeneous multiple discrete testing based on randomized tests

Biometrics (2019) 75(2) 638-649

DOI: 10.1111/biom.12996

1Citations

1Readers

Get full text

Abstract

In the analysis of next-generation sequencing technology, massive discrete data are generated from short read counts with varying biological coverage. Conducting conditional hypothesis testing such as Fisher's Exact Test at every genomic region of interest thus leads to a heterogeneous multiple discrete testing problem. However, most existing multiple testing procedures for controlling the false discovery rate (FDR) assume that test statistics are continuous and become conservative for discrete tests. To overcome the conservativeness, in this article, we propose a novel multiple testing procedure for better FDR control on heterogeneous discrete tests. Our procedure makes decisions based on the marginal critical function (MCF) of randomized tests, which enables achieving a powerful and non-randomized multiple testing procedure. We provide upper bounds of the positive FDR (pFDR) and the positive false non-discovery rate (pFNR) corresponding to our procedure. We also prove that the set of detections made by our method contains every detection made by a naive application of the widely-used q-value method. We further demonstrate the improvement of our method over other existing multiple testing procedures by simulations and a real example of differentially methylated region (DMR) detection using whole-genome bisulfite sequencing (WGBS) data.

Author supplied keywords

Cite

CITATION STYLE

APA

Dai, X., Lin, N., Li, D., & Wang, T. (2019). A non-randomized procedure for large-scale heterogeneous multiple discrete testing based on randomized tests. Biometrics, 75(2), 638–649. https://doi.org/10.1111/biom.12996

A non-randomized procedure for large-scale heterogeneous multiple discrete testing based on randomized tests

Abstract

Author supplied keywords

Cite

Register to see more suggestions