Malaria is one of the world's most deadly diseases and is caused by the parasite Plasmodium falciparum. Sixty percent of P. falciparum genes have no known function and therefore new methods of gene function prediction are needed. To address this problem, we train a naïve Bayes classifier on multiple sources of data and subsequently apply a modified version of the Gene Set Enrichment Analysis Algorithm to predict gene function in P. falciparum. To define gene function, we exploit the hierarchical structure of the Gene Ontology, specifically using the Biological Process category. We demonstrate the value of integrating multiple data sources by achieving accurate predictions on genes that cannot be annotated using simple sequence similarity based methods. © 2009 Springer Berlin Heidelberg.
CITATION STYLE
Tedder, P. M. R., Bradford, J. R., Needham, C. J., McConkey, G. A., Bulpitt, A. J., & Westhead, D. R. (2009). Bayesian data integration and enrichment analysis for predicting gene function in malaria. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 5635 LNCS, pp. 457–466). https://doi.org/10.1007/978-3-642-03073-4_47
Mendeley helps you to discover research relevant for your work.