Genome cluster database. A sequence family analysis platform for arabidopsis and rice

31Citations
Citations of this article
43Readers
Mendeley users who have this article in their library.
Get full text

Abstract

The genome-wide protein sequences from Arabidopsis (Arabidopsis thaliana) and rice (Oryza sativa) spp. japonica were clustered into families using sequence similarity and domain-based clustering. The two fundamentally different methods resulted in separate cluster sets with complementary properties to compensate the limitations for accurate family analysis. Functional names for the identified families were assigned with an efficient computational approach that uses the description of the most common molecular function gene ontology node within each cluster. Subsequently, multiple alignments and phylogenetic trees were calculated for the assembled families. All clustering results and their underlying sequences were organized in the Web-accessible Genome Cluster Database (http://bioinfo.ucr.edu/projects/GCD) with rich interactive and user-friendly sequence family mining tools to facilitate the analysis of any given family of interest for the plant science community. An automated clustering pipeline ensures current information for future updates in the annotations of the two genomes and clustering improvements. The analysis allowed the first systematic identification of family and singlet proteins present in both organisms as well as those restricted to one of them. In addition, the established Web resources for mining these data provide a road map for future studies of the composition and structure of protein families between the two species. © 2005 American Society of Plant Biologists.

References Powered by Scopus

Basic local alignment search tool

81778Citations
N/AReaders
Get full text

Gapped BLAST and PSI-BLAST: A new generation of protein database search programs

64346Citations
N/AReaders
Get full text

Gene ontology: Tool for the unification of biology

34050Citations
N/AReaders
Get full text

Cited by Powered by Scopus

Patterns of gene duplication in the plant SKP1 gene family in angiosperms: Evidence for multiple mechanisms of rapid gene birth

400Citations
N/AReaders
Get full text

The acyltransferase GPAT5 is required for the synthesis of suberin in seed coat and root of Arabidopsis

357Citations
N/AReaders
Get full text

PLAZA: A comparative genomics resource to study gene and genome evolution in plants

248Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Horan, K., Lauricha, J., Bailey-Serres, J., Raikhel, N., & Girke, T. (2005). Genome cluster database. A sequence family analysis platform for arabidopsis and rice. Plant Physiology, 138(1), 47–54. https://doi.org/10.1104/pp.104.059048

Readers over time

‘10‘11‘12‘13‘14‘15‘16‘17‘18‘19‘20‘21‘22‘23‘2402468

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 19

54%

Researcher 11

31%

Professor / Associate Prof. 5

14%

Readers' Discipline

Tooltip

Agricultural and Biological Sciences 33

89%

Biochemistry, Genetics and Molecular Bi... 2

5%

Computer Science 1

3%

Earth and Planetary Sciences 1

3%

Article Metrics

Tooltip
Mentions
News Mentions: 1

Save time finding and organizing research with Mendeley

Sign up for free
0