On the Underspecification of Situations in Open-domain Conversational Datasets

1Citations
Citations of this article
7Readers
Mendeley users who have this article in their library.

Abstract

Advances of open-domain conversational systems have been achieved through the creation of numerous conversation datasets. However, many of the commonly used datasets contain little or no information about the conversational situation, such as relevant objects/people, their properties, and relationships. This absence leads to underspecification of the problem space and typically results in undesired dialogue system behavior. This position paper discusses the current state of the field associated with processing situational information. An analysis of response generation using three datasets shows that explicitly provided situational information can improve the coherence and specificity of generated responses, but further experiments reveal that generation systems can be misled by irrelevant information. Our conclusions from this evaluation provide insights into the problem and directions for future research.

Cite

CITATION STYLE

APA

Otani, N., Araki, J., Kim, H. S., & Hovy, E. (2023). On the Underspecification of Situations in Open-domain Conversational Datasets. In Proceedings of the Annual Meeting of the Association for Computational Linguistics (pp. 12–28). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2023.nlp4convai-1.2

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free