MetaFam: A unified classification of protein families. II. Schema and query capabilities

9Citations
Citations of this article
12Readers
Mendeley users who have this article in their library.

Abstract

Motivation. Protein sequence and family data is accumulating at such a rapid rate that state-of-the-art databases and interface tools are required to aid curators with their classifications. We have designed such a system, MetaFam, to facilitate the comparison and integration of public protein sequence and family data. This paper presents the global schema, integration issues, and query capabilities of MetaFam. Results. MetaFam is an integrated data warehouse of information about protein families and their sequences. This data has been collected into a consistent global schema, and stored in an Oracle relational database. The warehouse implementation allows for quick removal of outdated data sets. In addition to the relational implementation of the primary schema, we have developed several derived tables that enable efficient access from data visualization and exploration tools. Through a series of straightforward SQL queries, we demonstrate the usefulness of this data warehouse for comparing protein family classifications and for functional assignment of new sequences.

Cite

CITATION STYLE

APA

Shoop, E., Silverstein, K. A. T., Johnson, J. E., & Retzel, E. F. (2001). MetaFam: A unified classification of protein families. II. Schema and query capabilities. Bioinformatics, 17(3), 262–271. https://doi.org/10.1093/bioinformatics/17.3.262

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free