A Pilot Study on Annotation Interfaces for Summary Comparisons

Sian Gooding; Lucas Werner; Victor Caˇrbune

Conference Proceedings

A Pilot Study on Annotation Interfaces for Summary Comparisons

Proceedings of the Annual Meeting of the Association for Computational Linguistics (2023) 179-187

ISSN: 0736587X

2Citations

15Readers

Abstract

The task of summarisation is notoriously difficult to evaluate, with agreement even between expert raters unlikely to be perfect. One technique for summary evaluation relies on collecting comparison data by presenting annotators with generated summaries and tasking them with selecting the best one. This paradigm is currently being exploited in reinforcement learning using human feedback, whereby a reward function is trained using pairwise choice data. Comparisons are an easier way to elicit human feedback for summarisation, however, such decisions can be bottle necked by the usability of the annotator interface. In this paper, we present the results of a pilot study exploring how the user interface impacts annotator agreement when judging summary quality.

Cite

CITATION STYLE

APA

Gooding, S., Werner, L., & Caˇrbune, V. (2023). A Pilot Study on Annotation Interfaces for Summary Comparisons. In Proceedings of the Annual Meeting of the Association for Computational Linguistics (pp. 179–187). Association for Computational Linguistics (ACL).

A Pilot Study on Annotation Interfaces for Summary Comparisons

Abstract

Cite

Register to see more suggestions