Evaluation methodologies in automatic question generation 2013-2018

Jacopo Amidei; Paul Piwek; Alistair Willis

Conference ProceedingsOPEN ACCESS

Evaluation methodologies in automatic question generation 2013-2018

INLG 2018 - 11th International Natural Language Generation Conference, Proceedings of the Conference (2018) 307-317

DOI: 10.18653/v1/w18-6537

39Citations

124Readers

Abstract

In the last few years Automatic Question Generation (AQG) has attracted increasing interest. In this paper we survey the evaluation methodologies used in AQG. Based on a sample of 37 papers, our research shows that the systems’ development has not been accompanied by similar developments in the methodologies used for the systems’ evaluation. Indeed, in the papers we examine here, we find a wide variety of both intrinsic and extrinsic evaluation methodologies. Such diverse evaluation practices make it difficult to reliably compare the quality of different generation systems. Our study suggests that, given the rapidly increasing level of research in the area, a common framework is urgently needed to compare the performance of AQG systems and NLG systems more generally.

Cite

CITATION STYLE

APA

Amidei, J., Piwek, P., & Willis, A. (2018). Evaluation methodologies in automatic question generation 2013-2018. In INLG 2018 - 11th International Natural Language Generation Conference, Proceedings of the Conference (pp. 307–317). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/w18-6537

Evaluation methodologies in automatic question generation 2013-2018

Abstract

Cite

Register to see more suggestions