genBlastA: Enabling BLAST to identify homologous gene sequences

Rong She; Jeffrey S.C. Chu; Ke Wang; Jian Pei; Nansheng Chen

Journal ArticleOPEN ACCESS

genBlastA: Enabling BLAST to identify homologous gene sequences

Genome Research (2009) 19(1) 143-149

DOI: 10.1101/gr.082081.108

224Citations

176Readers

Abstract

BLAST is an extensively used local similarity search tool for identifying homologous sequences. When a gene sequence (either protein sequence or nucleotide sequence) is used as a query to search for homologous sequences in a genome, the search results, represented as a list of high-scoring pairs (HSPs), are fragments of candidate genes rather than full-length candidate genes. Relevant HSPs ("signals"), which represent candidate genes in the target genome sequences, are buried within a report that contains also hundreds to thousands of random HSPs ("noises"). Consequently, BLAST results are often overwhelming and confusing even to experienced users. For effective use of BLAST, a program is needed for extracting relevant HSPs that represent candidate homologous genes from the entire HSP report. To achieve this goal, we have designed a graph-based algorithm, genBlastA, which automatically filters HSPs into well-defined groups, each representing a candidate gene in the target genome. The novelty of genBlastA is an edge length metric that reflects a set of biologically motivated requirements so that each shortest path corresponds to an HSP group representing a homologous gene. We have demonstrated that this novel algorithm is both efficient and accurate for identifying homologous sequences, and that it outperforms existing approaches with similar functionalities. © 2009 by Cold Spring Harbor Laboratory Press.

References Powered by Scopus

View more at Scopus

Cited by Powered by Scopus

View more at Scopus

Cite

CITATION STYLE

APA

She, R., Chu, J. S. C., Wang, K., Pei, J., & Chen, N. (2009). genBlastA: Enabling BLAST to identify homologous gene sequences. Genome Research, 19(1), 143–149. https://doi.org/10.1101/gr.082081.108

Readers' Seniority

PhD / Post grad / Masters / Doc 83

59%

Researcher 37

26%

Professor / Associate Prof. 19

13%

Lecturer / Post doc 2

Readers' Discipline

Agricultural and Biological Sciences 105

75%

Biochemistry, Genetics and Molecular Bi... 23

16%

Computer Science 8

Engineering 4

Article Metrics

Mentions

News Mentions: 1

Social Media

Shares, Likes & Comments: 4

View details >

genBlastA: Enabling BLAST to identify homologous gene sequences

Abstract

References Powered by Scopus

Basic local alignment search tool

Initial sequencing and analysis of the human genome

The sequence of the human genome

Cited by Powered by Scopus

Gossypium barbadense and Gossypium hirsutum genomes provide insights into the origin and evolution of allotetraploid cotton

The monarch butterfly genome yields insights into long-distance migration

Population genomics reveal recent speciation and rapid evolutionary adaptation in polar bears

Register to see more suggestions

Cite

Readers' Seniority

Readers' Discipline

Article Metrics