Post-processing of BLAST results using databases of clustered sequences

G. S. Miller; R. Fuchs

Journal ArticleOPEN ACCESS

Post-processing of BLAST results using databases of clustered sequences

Bioinformatics (1997) 13(1) 81-87

DOI: 10.1093/bioinformatics/13.1.81

0Citations

8Readers

Abstract

Motivation: When evaluating the results of a sequence similarity search, there are many situations where it can be useful to determine whether sequences appearing in the results share some distinguishing characteristic. Such dependencies between database entries are often not readily identifiable, but can yield important new insights into the biological function of a gene or protein. Results: We have developed a program called CBLAST that sorts the results of a BLAST sequence similarity search according to sequence membership in user-defined ‘clusters’ of sequences. To demonstrate the utility of this application, we have constructed two cluster databases. The first describes clusters of nucleotide sequences representing the same gene, as documented in the UNIGENE database, and the second describes clusters of protein sequences which are members of the protein families documented in the PROSITE database. Cluster databases and the CBLAST post-processor provide an efficient mechanism for identifying and exploring relationships and dependencies between new sequences and database entries. Availability: The software described in this article is available free of charge from the EBI software archive at ftp://ftp.ebi.ac.uk/pub/software/unix. © 1997 Oxford University Press.

Cite

CITATION STYLE

APA

Miller, G. S., & Fuchs, R. (1997). Post-processing of BLAST results using databases of clustered sequences. Bioinformatics, 13(1), 81–87. https://doi.org/10.1093/bioinformatics/13.1.81

Post-processing of BLAST results using databases of clustered sequences

Abstract

Cite

Register to see more suggestions