Frame: Detection of genomic sequencing errors

16Citations
Citations of this article
29Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

Motivation: The underlying error rate for genomic sequencing sometimes results in the introduction of artificial frameshifts and in-frame stop codons into putative protein encoding genes. Severe errors are then introduced into the inferred transcripts through mis-translation or premature termination. Results: We describe a system for screening segments of DNA for frameshift and in-frame stop errors in coding regions. The method is based on homology matching using blastx to compare all six reading frames of the query nucleotide sequence against selected protein sequence databases. Fragments of protein matching neighbouring regions of the query DNA are united and extended laterally to define candidate open reading frames, within which, frameshifts and stops are identified. Suitable targets include prokaryotic or other intron-free genomic sequence and complementary DNAs. As an example of its use, we report here two frameshifted ORFs that deviate from the original TIGR sequence annotations for the recently released Helico-bacter pylori genome. Availability: The tool is accessible via the URL http://www.sander.ebi.ac.uk/frame/. Contact: brown@@@ebi.ac.uk.

Cite

CITATION STYLE

APA

Brown, N. P., Sander, C., & Bork, P. (1998). Frame: Detection of genomic sequencing errors. Bioinformatics, 14(4), 367–371. https://doi.org/10.1093/bioinformatics/14.4.367

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free