Panning for genes - A visual strategy for identifying novel gene orthologs and paralogs

27Citations
Citations of this article
19Readers
Mendeley users who have this article in their library.

Abstract

We have developed a rapid visual method for identifying novel members of gene families. Starting with an evolutionary tree, 20-50 protein query sequences for a gene family are selected from different branches of the tree. These query sequences are used to search the GenBank and expressed sequence tag (EST) DNA databases and their nightly updates using the tfastx3 or tfasty3 programs. The results of all 20-50 searches are collated and resorted to highlight EST or genomic sequences that share significant similarity with the query sequences. The statistical significance of each DNA/protein alignment is plotted, highlighting the portion of the query sequence that is present in the database sequence and the percent identity in the aligned region. The collated results for database sequences are linked using the WWW to the underlying scores and alignments; these links can also be used to perform additional searches to characterize the novel sequence further. With traditional 'deep' scoring matrices (BLOSUM50) one can search for previously unrecognized families of large protein superfamilies. Alternatively, by using query sequences and EST libraries from the same species (e.g., human or mouse) together with 'shallow' scoring matrices and filters that remove high-identity sequences, one can highlight new paralogs of previously described subfamilies. Using query sequences from the glutathione transferase superfamily, we identified two novel mammalian glutathione transferase families that were recognized previously only in plants. Using query sequences from known mammalian glutathione transferase subfamilies, we identified new candidate paralogs from the mouse class-mu, class-pi, and class-theta families.

References Powered by Scopus

Basic local alignment search tool

78970Citations
N/AReaders
Get full text

Gapped BLAST and PSI-BLAST: A new generation of protein database search programs

63214Citations
N/AReaders
Get full text

CLUSTAL W: Improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice

58458Citations
N/AReaders
Get full text

Cited by Powered by Scopus

Glutathione and glutathione-dependent enzymes represent a co-ordinately regulated defence against oxidative stress

1329Citations
N/AReaders
Get full text

Glutathione S-transferase polymorphisms and their biological consequences

886Citations
N/AReaders
Get full text

Identification, characterization, and crystal structure of the omega class glutathione transferases

634Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Retief, J. D., Lynch, K. R., & Pearson, W. R. (1999). Panning for genes - A visual strategy for identifying novel gene orthologs and paralogs. Genome Research, 9(4), 373–382. https://doi.org/10.1101/gr.9.4.373

Readers' Seniority

Tooltip

Professor / Associate Prof. 9

60%

PhD / Post grad / Masters / Doc 4

27%

Researcher 2

13%

Readers' Discipline

Tooltip

Agricultural and Biological Sciences 8

53%

Computer Science 3

20%

Biochemistry, Genetics and Molecular Bi... 3

20%

Social Sciences 1

7%

Save time finding and organizing research with Mendeley

Sign up for free