Zero-shot Faithfulness Evaluation for Text Summarization with Foundation Language Model

0Citations
Citations of this article
16Readers
Mendeley users who have this article in their library.

Abstract

Despite tremendous improvements in natural language generation, summarization models still suffer from the unfaithfulness issue. Previous work evaluates faithfulness either using models trained on the other tasks or in-domain synthetic data, or prompting a large model such as ChatGPT. This paper proposes to do zero-shot faithfulness evaluation simply with a moderately-sized foundation language model. We introduce a new metric FFLM, which is a combination of probability changes based on the intuition that prefixing a piece of text that is consistent with the output will increase the probability of predicting the output. Experiments show that FFLM performs competitively with or even outperforms ChatGPT on both inconsistency detection and faithfulness rating with 24x fewer parameters. FFLM also achieves improvements over other strong baselines.

References Powered by Scopus

Survey of Hallucination in Natural Language Generation

1666Citations
N/AReaders
Get full text

Abstractive text summarization using sequence-to-sequence RNNs and beyond

1430Citations
N/AReaders
Get full text

The balanced accuracy and its posterior distribution

1282Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Jia, Q., Ren, S., Liu, Y., & Zhu, K. Q. (2023). Zero-shot Faithfulness Evaluation for Text Summarization with Foundation Language Model. In EMNLP 2023 - 2023 Conference on Empirical Methods in Natural Language Processing, Proceedings (pp. 11017–11031). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2023.emnlp-main.679

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 4

67%

Lecturer / Post doc 1

17%

Researcher 1

17%

Readers' Discipline

Tooltip

Computer Science 8

80%

Medicine and Dentistry 1

10%

Economics, Econometrics and Finance 1

10%

Save time finding and organizing research with Mendeley

Sign up for free