Perspectives: Sequence data base searching in the era of large-scale genomic sequencing

12Citations
Citations of this article
12Readers
Mendeley users who have this article in their library.

Abstract

Large-scale sequencing of human and model organism genomes will have a profound impact on our ability to use sequence data base searching to predict the biochemical functions of sequences of interest. Despite the great value of more sequences in the data bases, a huge increase in data base size will also have adverse effects on data base searches. Upcoming problems will include (1) greatly increased search times, (2) an increase in background noise of high-scoring but biologically irrelevant matches, (3) inaccurate coding region prediction, leading to problems in protein data base searching, and (4) limited first-pass sequence annotation, making it difficult to determine the biological relevance of data base hits. Improved data base annotation tools and construction of smaller data bases of representative and highly-annotated sequences for first-pass analyses will be essential to deal with the impending flood of new genomic sequence.

Cite

CITATION STYLE

APA

Smith, R. F. (1996). Perspectives: Sequence data base searching in the era of large-scale genomic sequencing. Genome Research. Cold Spring Harbor Laboratory Press. https://doi.org/10.1101/gr.6.8.653

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free