ChiMed: A Chinese medical corpus for question answering

34Citations
Citations of this article
90Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Question answering (QA) is a challenging task in natural language processing (NLP), especially when it is applied to specific domains. While models trained in the general domain can be adapted to a new target domain, their performance often degrades significantly due to domain mismatch. Alternatively, one can require a large amount of domain-specific QA data, but such data are rare, especially for the medical domain. In this study, we first collect a large-scale Chinese medical QA corpus called ChiMed; second we annotate a small fraction of the corpus to check the quality of the answers; third, we extract two datasets from the corpus and use them for the relevancy prediction task and the adoption prediction task. Several benchmark models are applied to the datasets, producing good results for both tasks.

Cite

CITATION STYLE

APA

Tian, Y., Ma, W., Xia, F., & Song, Y. (2019). ChiMed: A Chinese medical corpus for question answering. In BioNLP 2019 - SIGBioMed Workshop on Biomedical Natural Language Processing, Proceedings of the 18th BioNLP Workshop and Shared Task (pp. 250–260). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/w19-5027

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free