QMSum: A New Benchmark for Query-based Multi-domain Meeting Summarization

Ming Zhong; Da Yin; Tao Yu; Ahmad Zaidi; Mutethia Mutuma; Rahul Jha; Ahmed Hassan Awadallah; Asli Celikyilmaz; Yang Liu; Xipeng Qiu; Dragomir Radev

Conference ProceedingsOPEN ACCESS

QMSum: A New Benchmark for Query-based Multi-domain Meeting Summarization

NAACL-HLT 2021 - 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Proceedings of the Conference (2021) 5905-5921

DOI: 10.18653/v1/2021.naacl-main.472

178Citations

156Readers

Abstract

Meetings are a key component of human collaboration. As increasing numbers of meetings are recorded and transcribed, meeting summaries have become essential to remind those who may or may not have attended the meetings about the key decisions made and the tasks to be completed. However, it is hard to create a single short summary that covers all the content of a long meeting involving multiple people and topics. In order to satisfy the needs of different types of users, we define a new query-based multi-domain meeting summarization task, where models have to select and summarize relevant spans of meetings in response to a query, and we introduce QMSum, a new benchmark for this task. QMSum consists of 1,808 query-summary pairs over 232 meetings in multiple domains. Besides, we investigate a locate-then-summarize method and evaluate a set of strong summarization baselines on the task. Experimental results and manual analysis reveal that QMSum presents significant challenges in long meeting summarization for future research. Dataset is available at https://github.com/Yale-LILY/QMSum.

References Powered by Scopus

View more at Scopus

Cited by Powered by Scopus

View more at Scopus

Cite

CITATION STYLE

APA

Zhong, M., Yin, D., Yu, T., Zaidi, A., Mutuma, M., Jha, R., … Radev, D. (2021). QMSum: A New Benchmark for Query-based Multi-domain Meeting Summarization. In NAACL-HLT 2021 - 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Proceedings of the Conference (pp. 5905–5921). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2021.naacl-main.472

Readers over time

Readers' Seniority

PhD / Post grad / Masters / Doc 42

75%

Researcher 7

13%

Lecturer / Post doc 5

Professor / Associate Prof. 2

Readers' Discipline

Computer Science 59

84%

Linguistics 5

Engineering 4

Neuroscience 2

Article Metrics

Mentions

News Mentions: 1

View details >

QMSum: A New Benchmark for Query-based Multi-domain Meeting Summarization

Abstract

References Powered by Scopus

Convolutional neural networks for sentence classification

Get to the point: Summarization with pointer-generator networks

Abstractive text summarization using sequence-to-sequence RNNs and beyond

Cited by Powered by Scopus

Survey of Hallucination in Natural Language Generation

DIALOGLM: Pre-trained Model for Long Dialogue Understanding and Summarization

An Empirical Survey on Long Document Summarization: Datasets, Models, and Metrics

Register to see more suggestions

Cite

Readers over time

Readers' Seniority

Readers' Discipline

Article Metrics