On the Underspecification of Situations in Open-domain Conversational Datasets

Naoki Otani; Jun Araki; Hyeong Sik Kim; Eduard Hovy

Conference ProceedingsOPEN ACCESS

On the Underspecification of Situations in Open-domain Conversational Datasets

Proceedings of the Annual Meeting of the Association for Computational Linguistics (2023) 12-28

DOI: 10.18653/v1/2023.nlp4convai-1.2

1Citations

7Readers

Abstract

Advances of open-domain conversational systems have been achieved through the creation of numerous conversation datasets. However, many of the commonly used datasets contain little or no information about the conversational situation, such as relevant objects/people, their properties, and relationships. This absence leads to underspecification of the problem space and typically results in undesired dialogue system behavior. This position paper discusses the current state of the field associated with processing situational information. An analysis of response generation using three datasets shows that explicitly provided situational information can improve the coherence and specificity of generated responses, but further experiments reveal that generation systems can be misled by irrelevant information. Our conclusions from this evaluation provide insights into the problem and directions for future research.

Cite

CITATION STYLE

APA

Otani, N., Araki, J., Kim, H. S., & Hovy, E. (2023). On the Underspecification of Situations in Open-domain Conversational Datasets. In Proceedings of the Annual Meeting of the Association for Computational Linguistics (pp. 12–28). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2023.nlp4convai-1.2

On the Underspecification of Situations in Open-domain Conversational Datasets

Abstract

Cite

Register to see more suggestions