Text mining for contexts and relationships in cancer genomics literature

2Citations
Citations of this article
11Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

Motivation: Scientific advances build on the findings of existing research. The 2001 publication of the human genome has led to the production of huge volumes of literature exploring the context-specific functions and interactions of genes. Technology is needed to perform large-scale text mining of research papers to extract the reported actions of genes in specific experimental contexts and cell states, such as cancer, thereby facilitating the design of new therapeutic strategies. Results: We present a new corpus and Text Mining methodology that can accurately identify and extract the most important details of cancer genomics experiments from biomedical texts. We build a Named Entity Recognition model that accurately extracts relevant experiment details from PubMed abstract text, and a second model that identifies the relationships between them. This system outperforms earlier models and enables the analysis of gene function in diverse and dynamically evolving experimental contexts. Availability and implementation: Code and data are available here: https://github.com/cambridgeltl/functional-genomics-ie.

Cite

CITATION STYLE

APA

Collins, C., Baker, S., Brown, J., Zheng, H., Chan, A., Stenius, U., … Korhonen, A. (2024). Text mining for contexts and relationships in cancer genomics literature. Bioinformatics, 40(1). https://doi.org/10.1093/bioinformatics/btae021

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free