Advances of open-domain conversational systems have been achieved through the creation of numerous conversation datasets. However, many of the commonly used datasets contain little or no information about the conversational situation, such as relevant objects/people, their properties, and relationships. This absence leads to underspecification of the problem space and typically results in undesired dialogue system behavior. This position paper discusses the current state of the field associated with processing situational information. An analysis of response generation using three datasets shows that explicitly provided situational information can improve the coherence and specificity of generated responses, but further experiments reveal that generation systems can be misled by irrelevant information. Our conclusions from this evaluation provide insights into the problem and directions for future research.
CITATION STYLE
Otani, N., Araki, J., Kim, H. S., & Hovy, E. (2023). On the Underspecification of Situations in Open-domain Conversational Datasets. In Proceedings of the Annual Meeting of the Association for Computational Linguistics (pp. 12–28). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2023.nlp4convai-1.2
Mendeley helps you to discover research relevant for your work.