Hubs of knowledge: Using the functional link structure in Biozon to mine for biologically significant entities

12Citations
Citations of this article
16Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

Background: Existing biological databases support a variety of queries such as keyword or definition search. However, they do not provide any measure of relevance for the instances reported, and result sets are usually sorted arbitrarily. Results: We describe a system that builds upon the complex infrastructure of the Biozon database and applies methods similar to those of Google to rank documents that match queries. We explore different prominence models and study the spectral properties of the corresponding data graphs. We evaluate the information content of principal and non-principal eigenspaces, and test various scoring functions which combine contributions from multiple eigenspaces. We also test the effect of similarity data and other variations which are unique to the biological knowledge domain on the quality of the results. Query result sets are assessed using a probabilistic approach that measures the significance of coherence between directly connected nodes in the data graph. This model allows us, for the first time, to compare different prominence models quantitatively and effectively and to observe unique trends. Conclusion: Our tests show that the ranked query results outperform unsorted results with respect to our significance measure and the top ranked entities are typically linked to many other biological entities. Our study resulted in a working ranking system of biological entities that was integrated into Biozon at http://biozon.org. © 2006 Shafer et al; licensee BioMed Central Ltd.

References Powered by Scopus

The meaning and use of the area under a receiver operating characteristic (ROC) curve

17814Citations
N/AReaders
Get full text

KEGG: Kyoto encyclopedia of genes and genomes

4119Citations
N/AReaders
Get full text

A new status index derived from sociometric analysis

2851Citations
N/AReaders
Get full text

Cited by Powered by Scopus

BIOZON: A system for unification, management and analysis of heterogeneous biological data

72Citations
N/AReaders
Get full text

The structure of collaboration in the Journal of Finance

65Citations
N/AReaders
Get full text

Authority-based keyword search in databases

60Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Shafer, P., Isganitis, T., & Yona, G. (2006). Hubs of knowledge: Using the functional link structure in Biozon to mine for biologically significant entities. BMC Bioinformatics, 7. https://doi.org/10.1186/1471-2105-7-71

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 9

60%

Researcher 6

40%

Readers' Discipline

Tooltip

Computer Science 7

47%

Agricultural and Biological Sciences 5

33%

Mathematics 2

13%

Medicine and Dentistry 1

7%

Save time finding and organizing research with Mendeley

Sign up for free