Nutri-bullets Hybrid: Consensual Multi-document Summarization

Darsh J. Shah; Lili Yu; Tao Lei; Regina Barzilay

Conference Proceedings

Nutri-bullets Hybrid: Consensual Multi-document Summarization

NAACL-HLT 2021 - 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Proceedings of the Conference (2021) 5213-5222

DOI: 10.18653/v1/2021.naacl-main.411

8Citations

60Readers

Get full text

Abstract

We present a method for generating comparative summaries that highlights similarities and contradictions in input documents. The key challenge in creating such summaries is the lack of large parallel training data required for training typical summarization systems. To this end, we introduce a hybrid generation approach inspired by traditional concept-to-text systems. To enable accurate comparison between different sources, the model first learns to extract pertinent relations from input documents. The content planning component uses deterministic operators to aggregate these relations after identifying a subset for inclusion into a summary. The surface realization component lexicalizes this information using a text-infilling language model. By separately modeling content selection and realization, we can effectively train them with limited annotations. We implemented and tested the model in the domain of nutrition and health – rife with inconsistencies. Compared to conventional methods, our framework leads to more faithful, relevant and aggregation-sensitive summarization – while being equally fluent.

Cite

CITATION STYLE

APA

Shah, D. J., Yu, L., Lei, T., & Barzilay, R. (2021). Nutri-bullets Hybrid: Consensual Multi-document Summarization. In NAACL-HLT 2021 - 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Proceedings of the Conference (pp. 5213–5222). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2021.naacl-main.411

Nutri-bullets Hybrid: Consensual Multi-document Summarization

Abstract

Cite

Register to see more suggestions