Sequence-specific reconstruction from fragmentary databases using seed sequences: Implementation and validation on SAGE, proteome and generic sequencing data

7Citations
Citations of this article
34Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

Motivation: DNA assembly programs classically perform an all-against-all comparison of reads to identify overlaps, followed by a multiple sequence alignment and generation of a consensus sequence. If the aim is to assemble a particular segment, instead of a whole genome or transcriptome, a target-specific assembly is a more sensible approach. GenSeed is a Perl program that implements a seed-driven recursive assembly consisting of cycles comprising a similarity search, read selection and assembly. The iterative process results in a progressive extension of the original seed sequence. GenSeed was tested and validated on many applications, including the reconstruction of nuclear genes or segments, full-length transcripts, and extrachromosomal genomes. The robustness of the method was confirmed through the use of a variety of DNA and protein seeds, including short sequences derived from SAGE and proteome projects. © The Author 2008. Published by Oxford University Press. All rights reserved.

Cite

CITATION STYLE

APA

Sobreira, T. J. P., & Gruber, A. (2008). Sequence-specific reconstruction from fragmentary databases using seed sequences: Implementation and validation on SAGE, proteome and generic sequencing data. Bioinformatics, 24(15), 1676–1680. https://doi.org/10.1093/bioinformatics/btn283

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free