Finding consistent patterns: A nonparametric approach for identifying differential expression in RNA-Seq data

Jun Li; Robert Tibshirani

Journal Article

Finding consistent patterns: A nonparametric approach for identifying differential expression in RNA-Seq data

Statistical Methods in Medical Research (2013) 22(5) 519-536

DOI: 10.1177/0962280211428386

296Citations

637Readers

Get full text

Abstract

We discuss the identification of features that are associated with an outcome in RNA-Sequencing (RNA-Seq) and other sequencing-based comparative genomic experiments. RNA-Seq data takes the form of counts, so models based on the normal distribution are generally unsuitable. The problem is especially challenging because different sequencing experiments may generate quite different total numbers of reads, or 'sequencing depths'. Existing methods for this problem are based on Poisson or negative binomial models: they are useful but can be heavily influenced by 'outliers' in the data. We introduce a simple, non-parametric method with resampling to account for the different sequencing depths. The new method is more robust than parametric methods. It can be applied to data with quantitative, survival, two-class or multiple-class outcomes. We compare our proposed method to Poisson and negative binomial-based methods in simulated and real data sets, and find that our method discovers more consistent patterns than competing methods. © The Authors 2011 Reprints and permissions: sagepub.co.uk/journalsPermissions.nav.

Author supplied keywords

Cite

CITATION STYLE

APA

Li, J., & Tibshirani, R. (2013). Finding consistent patterns: A nonparametric approach for identifying differential expression in RNA-Seq data. Statistical Methods in Medical Research, 22(5), 519–536. https://doi.org/10.1177/0962280211428386

Finding consistent patterns: A nonparametric approach for identifying differential expression in RNA-Seq data

Abstract

Author supplied keywords

Cite

Register to see more suggestions