M3: Multi-level dataset for Multi-document summarization of Medical studies

5Citations
Citations of this article
38Readers
Mendeley users who have this article in their library.
Get full text

Abstract

We present M3 (Multi-level dataset for Multi-document summarisation of Medical studies), a benchmark dataset for evaluating the quality of summarisation systems in the biomedical domain. The dataset contains sets of multiple input documents and target summaries of three levels of complexity: documents, sentences, and propositions. The dataset also includes several levels of annotation, including biomedical entities, direction, and strength of relations between them, and the discourse relationships between the input documents (“contradiction” or “agreement”). We showcase usage scenarios of the dataset by testing 10 generic and domain-specific summarisation models in a zero-shot setting, and introduce a probing task based on counterfactuals to test if models are aware of the direction and strength of the conclusions generated from input studies.

Cite

CITATION STYLE

APA

Otmakhova, Y., Verspoor, K., Baldwin, T., Yepes, A. J., & Lau, J. H. (2022). M3: Multi-level dataset for Multi-document summarization of Medical studies. In Findings of the Association for Computational Linguistics: EMNLP 2022 (pp. 3916–3930). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2022.findings-emnlp.222

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free