Improving transcriptome assembly through error correction of high-throughput sequence reads

32Citations
Citations of this article
156Readers
Mendeley users who have this article in their library.

Abstract

The study of functional genomics, particularly in non-model organisms, has been dramatically improved over the last few years by the use of transcriptomes and RNAseq. While these studies are potentially extremely powerful, a computationally intensive procedure, the de novo construction of a reference transcriptome must be completed as a prerequisite to further analyses. The accurate reference is critically important as all downstream steps, including estimating transcript abundance are critically dependent on the construction of an accurate reference. Though a substantial amount of research has been done on assembly, only recently have the pre-assembly procedures been studied in detail. Specifically, several stand-alone error correction modules have been reported on and, while they have shown to be effective in reducing errors at the level of sequencing reads, how error correction impacts assembly accuracy is largely unknown. Here, we show via use of a simulated and empiric dataset, that applying error correction to sequencing reads has significant positive effects on assembly accuracy, and should be applied to all datasets. A complete collection of commands which will allow for the production of REPTILE corrected reads is available at https://github.com/macmanes/error correction/tree/master/scripts and as File S1. © 2013 MacManes and Eisen.

References Powered by Scopus

Full-length transcriptome assembly from RNA-Seq data without a reference genome

15750Citations
N/AReaders
Get full text

BLAST+: Architecture and applications

13350Citations
N/AReaders
Get full text

Transcript assembly and quantification by RNA-Seq reveals unannotated transcripts and isoform switching during cell differentiation

12222Citations
N/AReaders
Get full text

Cited by Powered by Scopus

On the optimal trimming of high-throughput mRNA sequence data

136Citations
N/AReaders
Get full text

Dynamic pigmentary and structural coloration within cephalopod chromatophore organs

125Citations
N/AReaders
Get full text

Next-generation sequencing (NGS) in the microbiological world: How to make the most of your money

107Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

MacManes, M. D., & Eisen, M. B. (2013). Improving transcriptome assembly through error correction of high-throughput sequence reads. PeerJ, 2013(1). https://doi.org/10.7717/peerj.113

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 77

56%

Researcher 47

34%

Professor / Associate Prof. 12

9%

Lecturer / Post doc 2

1%

Readers' Discipline

Tooltip

Agricultural and Biological Sciences 99

72%

Biochemistry, Genetics and Molecular Bi... 23

17%

Computer Science 13

9%

Engineering 3

2%

Article Metrics

Tooltip
Social Media
Shares, Likes & Comments: 74

Save time finding and organizing research with Mendeley

Sign up for free