Identification of novel multi-transmembrane proteins from genomic databases using quasi-periodic structural properties

64Citations
Citations of this article
45Readers
Mendeley users who have this article in their library.

Abstract

Motivation: Identification of novel G protein-coupled receptors and other multi-transmembrane proteins from genomic databases using structural features. Results: Here we describe a new algorithm for identifying multi-transmembrane proteins from genomic databases with a specific application to identifying G protein-coupled receptors (GPCRs) that we call quasi-periodic feature classifier (QFC). The QFC algorithm uses concise statistical variables as the 'feature space' to characterize the quasi-periodic physico-chemical properties of multi-transmembrane proteins. For the case of identifying GPCRs, the variables are then used in a non-parametric linear discriminant function to separate GPCRs from non-GPCRs. The algorithm runs in time linearly proportional to the number of sequences, and performance on a test dataset shows 96% positive identification of known GPCRs. The QFC algorithm also works well with short random segments of proteins and it positively identified GPCRs at a level greater than 90% even with segments as short as 100 amino acids. The primary advantage of the algorithm is that it does not directly use primary sequence patterns which may be subject to sampling bias. The utility of the new algorithm has been demonstrated by the isolation from the Drosophila genome project database of a novel class of seven-transmembrane proteins which were shown to be the elusive olfactory receptor genes of Drosophila.

References Powered by Scopus

Basic local alignment search tool

78928Citations
28412Readers
Get full text
Get full text
Get full text

Cited by Powered by Scopus

Molecular evolution of the insect chemoreceptor gene superfamily in Drosophila melanogaster

630Citations
470Readers
Get full text
571Citations
340Readers
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Kim, J., Moriyama, E. N., Warr, C. G., Clyne, P. J., & Carlson, J. R. (2000). Identification of novel multi-transmembrane proteins from genomic databases using quasi-periodic structural properties. Bioinformatics, 16(9), 767–775. https://doi.org/10.1093/bioinformatics/16.9.767

Readers' Seniority

Tooltip

Researcher 20

48%

PhD / Post grad / Masters / Doc 15

36%

Professor / Associate Prof. 5

12%

Lecturer / Post doc 2

5%

Readers' Discipline

Tooltip

Agricultural and Biological Sciences 27

66%

Biochemistry, Genetics and Molecular Bi... 9

22%

Neuroscience 3

7%

Computer Science 2

5%

Save time finding and organizing research with Mendeley

Sign up for free