Empathy is a crucial factor in open-domain conversations, which naturally shows one's caring and understanding to others. Though several methods have been proposed to generate empathetic responses, existing works often lead to monotonous empathy that refers to generic and safe expressions. In this paper, we propose to use explicit control to guide the empathy expression and design a framework DIFFUSEMP based on conditional diffusion language model to unify the utilization of dialogue context and attribute-oriented control signals. Specifically, communication mechanism, intent, and semantic frame are imported as multi-grained signals that control the empathy realization from coarse to fine levels. We then design a specific masking strategy to reflect the relationship between multi-grained signals and response tokens, and integrate it into the diffusion model to influence the generative process. Experimental results on a benchmark dataset EMPATHETICDIALOGUE show that our framework outperforms competitive baselines in terms of controllability, informativeness, and diversity without the loss of context-relatedness.
CITATION STYLE
Bi, G., Shen, L., Cao, Y., Chen, M., Xie, Y., Lin, Z., & He, X. (2023). DIFFUSEMP: A Diffusion Model-Based Framework with Multi-Grained Control for Empathetic Response Generation. In Proceedings of the Annual Meeting of the Association for Computational Linguistics (Vol. 1, pp. 2812–2831). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2023.acl-long.158
Mendeley helps you to discover research relevant for your work.