Integration of multiple data sources to prioritize candidate genes using discounted rating system

49Citations
Citations of this article
35Readers
Mendeley users who have this article in their library.

Abstract

Background: Identifying disease gene from a list of candidate genes is an important task in bioinformatics. The main strategy is to prioritize candidate genes based on their similarity to known disease genes. Most of existing gene prioritization methods access only one genomic data source, which is noisy and incomplete. Thus, there is a need for the integration of multiple data sources containing different information.Results: In this paper, we proposed a combination strategy, called discounted rating system (DRS). We performed leave one out cross validation to compare it with N-dimensional order statistics (NDOS) used in Endeavour. Results showed that the AUC (Area Under the Curve) values achieved by DRS were comparable with NDOS on most of the disease families. But DRS worked much faster than NDOS, especially when the number of data sources increases. When there are 100 candidate genes and 20 data sources, DRS works more than 180 times faster than NDOS. In the framework of DRS, we give different weights for different data sources. The weighted DRS achieved significantly higher AUC values than NDOS.Conclusion: The proposed DRS algorithm is a powerful and effective framework for candidate gene prioritization. If weights of different data sources are proper given, the DRS algorithm will perform better. © 2010 Li and Patra; licensee BioMed Central Ltd.

References Powered by Scopus

BioGRID: a general repository for interaction datasets.

3105Citations
N/AReaders
Get full text

Online Mendelian Inheritance in Man (OMIM), a knowledgebase of human genes and genetic disorders

2293Citations
N/AReaders
Get full text

A gene-coexpression network for global discovery of conserved genetic modules

1756Citations
N/AReaders
Get full text

Cited by Powered by Scopus

Genome-wide inferring gene-phenotype relationship by walking on the heterogeneous network

338Citations
N/AReaders
Get full text

BioGraph: Unsupervised biomedical knowledge discovery via automated hypothesis generation

101Citations
N/AReaders
Get full text

G-Protein Coupled Receptor 30 (GPR30): A Novel Regulator of Endothelial Inflammation

99Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Li, Y., & Patra, J. C. (2010). Integration of multiple data sources to prioritize candidate genes using discounted rating system. BMC Bioinformatics, 11(SUPPLL.1). https://doi.org/10.1186/1471-2105-11-S1-S20

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 20

74%

Professor / Associate Prof. 4

15%

Researcher 3

11%

Readers' Discipline

Tooltip

Agricultural and Biological Sciences 14

52%

Computer Science 10

37%

Biochemistry, Genetics and Molecular Bi... 2

7%

Chemistry 1

4%

Save time finding and organizing research with Mendeley

Sign up for free