Knowledge-Empowered Representation Learning for Chinese Medical Reading Comprehension: Task, Model and Resources

3Citations
Citations of this article
65Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Machine Reading Comprehension (MRC) aims to extract answers to questions given a passage, which has been widely studied recently especially in open domains. However, few efforts have been made on closed-domain MRC, mainly due to the lack of large-scale training data. In this paper, we introduce a multi-target MRC task for the medical domain, whose goal is to predict answers to medical questions and the corresponding support sentences from medical information sources simultaneously, in order to ensure the high reliability of medical knowledge serving. A high-quality dataset (more than 18k samples) is manually constructed for the purpose, named Multi-task Chinese Medical MRC dataset (CMedMRC), with detailed analysis conducted. We further propose a Chinese medical BERT model for the task (CMedBERT), which fuses medical knowledge into pre-trained language models by the dynamic fusion mechanism of heterogeneous features and the multi-task learning strategy. Experiments show that CMedBERT consistently outperforms strong baselines by fusing context-aware and knowledge-aware token representations.

Cite

CITATION STYLE

APA

Zhang, T., Wang, C., Qiu, M., Yang, B., Cai, Z., He, X., & Huang, J. (2021). Knowledge-Empowered Representation Learning for Chinese Medical Reading Comprehension: Task, Model and Resources. In Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021 (pp. 2237–2249). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2021.findings-acl.197

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free