DNA sequence quality trimming and vector removal

425Citations
Citations of this article
252Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

Motivation: Most sequence comparison methods assume that the data being compared are trustworthy, but this is not the case with raw DNA sequences obtained from automatic sequencing machines. Nevertheless, sequence comparisons need to be done on them in order to remove vector splice sites and contaminants. This step is necessary before other genomic data processing stages can be carried out, such as fragment assembly or EST clustering. A specialized tool is therefore needed to solve this apparent dilemma. Results: We have designed and implemented a program that specifically addresses the problem. This program, called Lucy, has been in use since 1998 at The Institute for Genomic Research (TIGR). During this period, many rounds of experience-driven modifications were made to Lucy to improve its accuracy and its ability to deal with extremely difficult input cases. We believe we have finally obtained a useful program which strikes a delicate balance among the many issues involved in the raw sequence cleaning problem, and we wish to share it with the research community.

Cite

CITATION STYLE

APA

Chou, H. H., & Holmes, M. H. (2002). DNA sequence quality trimming and vector removal. Bioinformatics, 17(12), 1093–1104. https://doi.org/10.1093/bioinformatics/17.12.1093

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free