A global network of biomedical relationships derived from text

89Citations
Citations of this article
142Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

Motivation: The biomedical community's collective understanding of how chemicals, genes and phenotypes interact is distributed across the text of over 24million research articles. These interactions offer insights into the mechanisms behind higher order biochemical phenomena, such as drug-drug interactions and variations in drug response across individuals. To assist their curation at scale, we must understand what relationship types are possible and map unstructured natural language descriptions onto these structured classes. We used NCBI's PubTator annotations to identify instances of chemical, gene and disease names in Medline abstracts and applied the Stanford dependency parser to find connecting dependency paths between pairs of entities in single sentences. We combined a published ensemble biclustering algorithm (EBC) with hierarchical clustering to group the dependency paths into semantically-related categories, which we annotated with labels, or 'themes' ('inhibition' and 'activation', for example). We evaluated our theme assignments against six human-curated databases: DrugBank, Reactome, SIDER, the Therapeutic Target Database, OMIM and PharmGKB. Results: Clustering revealed 10 broad themes for chemical-gene relationships, 7 for chemicaldisease, 10 for gene-disease and 9 for gene-gene. In most cases, enriched themes corresponded directly to known database relationships. Our final dataset, represented as a network, contained 37 491 thematically-labeled chemical-gene edges, 2 021 192 chemical-disease edges, 136 206 genedisease edges and 41 418 gene-gene edges, each representing a single-sentence description of an interaction from somewhere in the literature.

Cite

CITATION STYLE

APA

Percha, B., & Altman, R. B. (2018). A global network of biomedical relationships derived from text. Bioinformatics, 34(15), 2614–2624. https://doi.org/10.1093/bioinformatics/bty114

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free