Optimization of miRNA-seq data preprocessing

110Citations
Citations of this article
434Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

The past two decades ofmicroRNA (miRNA) research has solidified the role of these small non-coding RNAs as key regula- tors ofmany biological processes and promising biomarkers for disease. The concurrent development in high-throughput profiling technology has further advanced our understanding of the impact of their dysregulation on a global scale. Currently, next-generation sequencing is the platformof choice for the discovery and quantification ofmiRNAs. Despite this, there is no clear consensus on how the data should be preprocessed before conducting downstreamanalyses. Often overlooked, data preprocessing is an essential step in data analysis: the presence of unreliable features and noise can affect the conclusions drawn fromdownstreamanalyses. Using a spike-in dilution study, we evaluated the effects of several gen- eral-purpose aligners (BWA, Bowtie, Bowtie 2 and Novoalign), and normalization methods (counts-per-million, total count scaling, upper quartile scaling, Trimmed Mean of M, DESeq, linear regression, cyclic loess and quantile) with respect to the final miRNA count data distribution, variance, bias and accuracy of differential expression analysis. Wemake practical recommendations on the optimal preprocessingmethods for the extraction and interpretation of miRNA count data from small RNA-sequencing experiments.

Cite

CITATION STYLE

APA

Tam, S., Tsao, M. S., & McPherson, J. D. (2015). Optimization of miRNA-seq data preprocessing. Briefings in Bioinformatics, 16(6), 950–963. https://doi.org/10.1093/bib/bbv019

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free