Post-processing of BLAST results using databases of clustered sequences

0Citations
Citations of this article
8Readers
Mendeley users who have this article in their library.

Abstract

Motivation: When evaluating the results of a sequence similarity search, there are many situations where it can be useful to determine whether sequences appearing in the results share some distinguishing characteristic. Such dependencies between database entries are often not readily identifiable, but can yield important new insights into the biological function of a gene or protein. Results: We have developed a program called CBLAST that sorts the results of a BLAST sequence similarity search according to sequence membership in user-defined ‘clusters’ of sequences. To demonstrate the utility of this application, we have constructed two cluster databases. The first describes clusters of nucleotide sequences representing the same gene, as documented in the UNIGENE database, and the second describes clusters of protein sequences which are members of the protein families documented in the PROSITE database. Cluster databases and the CBLAST post-processor provide an efficient mechanism for identifying and exploring relationships and dependencies between new sequences and database entries. Availability: The software described in this article is available free of charge from the EBI software archive at ftp://ftp.ebi.ac.uk/pub/software/unix. © 1997 Oxford University Press.

Cite

CITATION STYLE

APA

Miller, G. S., & Fuchs, R. (1997). Post-processing of BLAST results using databases of clustered sequences. Bioinformatics, 13(1), 81–87. https://doi.org/10.1093/bioinformatics/13.1.81

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free