We propose a shared task on methodologies and algorithms for evaluating the accuracy of generated texts, specifically summaries of basketball games produced from basketball box score and other game data. We welcome submissions based on protocols for human evaluation, automatic metrics, as well as combinations of human evaluations and metrics.
CITATION STYLE
Reiter, E., & Thomson, C. (2020). Shared Task on Evaluating Accuracy. In INLG 2020 - 13th International Conference on Natural Language Generation, Proceedings (pp. 227–231). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2020.inlg-1.28
Mendeley helps you to discover research relevant for your work.