Survey on evaluation methods for dialogue systems

Jan Deriu; Alvaro Rodrigo; Arantxa Otegi; Guillermo Echegoyen; Sophie Rosset; Eneko Agirre; Mark Cieliebak

Journal ArticleOPEN ACCESS

Survey on evaluation methods for dialogue systems

Artificial Intelligence Review (2021) 54(1) 755-810

DOI: 10.1007/s10462-020-09866-x

212Citations

410Readers

Abstract

In this paper, we survey the methods and concepts developed for the evaluation of dialogue systems. Evaluation, in and of itself, is a crucial part during the development process. Often, dialogue systems are evaluated by means of human evaluations and questionnaires. However, this tends to be very cost- and time-intensive. Thus, much work has been put into finding methods which allow a reduction in involvement of human labour. In this survey, we present the main concepts and methods. For this, we differentiate between the various classes of dialogue systems (task-oriented, conversational, and question-answering dialogue systems). We cover each class by introducing the main technologies developed for the dialogue systems and then present the evaluation methods regarding that class.

Author supplied keywords

Cite

CITATION STYLE

APA

Deriu, J., Rodrigo, A., Otegi, A., Echegoyen, G., Rosset, S., Agirre, E., & Cieliebak, M. (2021). Survey on evaluation methods for dialogue systems. Artificial Intelligence Review, 54(1), 755–810. https://doi.org/10.1007/s10462-020-09866-x

Survey on evaluation methods for dialogue systems

Abstract

Author supplied keywords

Cite

Register to see more suggestions