Pathway2Text: Dataset and Method for Biomedical Pathway Description Generation

2Citations
Citations of this article
32Readers
Mendeley users who have this article in their library.

Abstract

Biomedical pathways have been extensively used to characterize the mechanism of complex diseases. One essential step in biomedical pathway analysis is to curate the description of a pathway based on its graph structure and node features. Neural text generation could be a plausible technique to circumvent the tedious manual curation. In this paper, we propose a new dataset Pathway2Text, which contains 2,367 pairs of biomedical pathways and textual descriptions. All pathway graphs are experimentally derived or manually curated. All textual descriptions are written by domain experts. We form this problem as a Graph2Text task and propose a novel graph-based text generation approach kNNGraph2Text, which explicitly exploited descriptions of similar graphs to generate new descriptions. We observed substantial improvement of our method on both Graph2Text and the reverse task of Text2Graph. We further illustrated how our dataset can be used as a novel benchmark for biomedical named entity recognition. Collectively, we envision our method will become an important benchmark for evaluating Graph2Text methods and advance biomedical research for complex diseases.

References Powered by Scopus

Finding the missing heritability of complex diseases

6484Citations
N/AReaders
Get full text

KEGG: New perspectives on genomes, pathways, diseases and drugs

6135Citations
N/AReaders
Get full text

UniProt: the universal protein knowledgebase in 2021

4669Citations
N/AReaders
Get full text

Cited by Powered by Scopus

Knowledge grounded medical dialogue generation using augmented graphs

18Citations
N/AReaders
Get full text

MetaFill: Text Infilling for Meta-Path Generation on Heterogeneous Information Networks

0Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Yang, J., Liu, Z., Zhang, M., & Wang, S. (2022). Pathway2Text: Dataset and Method for Biomedical Pathway Description Generation. In Findings of the Association for Computational Linguistics: NAACL 2022 - Findings (pp. 1441–1454). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2022.findings-naacl.108

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 5

45%

Researcher 4

36%

Professor / Associate Prof. 1

9%

Lecturer / Post doc 1

9%

Readers' Discipline

Tooltip

Computer Science 10

71%

Linguistics 2

14%

Neuroscience 1

7%

Agricultural and Biological Sciences 1

7%

Save time finding and organizing research with Mendeley

Sign up for free