Recently, various response generation models for two-party conversations have achieved impressive improvements, but less effort has been paid to multi-party conversations (MPCs) which are more practical and complicated. Compared with a two-party conversation where a dialogue context is a sequence of utterances, building a response generation model for MPCs is more challenging, since there exist complicated context structures and the generated responses heavily rely on both interlocutors (i.e., speaker and addressee) and history utterances. To address these challenges, we present HeterMPC, a heterogeneous graph-based neural network for response generation in MPCs which models the semantics of utterances and interlocutors simultaneously with two types of nodes in a graph. Besides, we also design six types of meta relations with node-edge-type-dependent parameters to characterize the heterogeneous interactions within the graph. Through multi-hop updating, HeterMPC can adequately utilize the structural knowledge of conversations for response generation. Experimental results on the Ubuntu Internet Relay Chat (IRC) channel benchmark show that HeterMPC outperforms various baseline models for response generation in MPCs.
CITATION STYLE
Gu, J. C., Tan, C. H., Tao, C., Ling, Z. H., Hu, H., Geng, X., & Jiang, D. (2022). HETERMPC: A Heterogeneous Graph Neural Network for Response Generation in Multi-Party Conversations. In Proceedings of the Annual Meeting of the Association for Computational Linguistics (Vol. 1, pp. 5086–5097). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2022.acl-long.349
Mendeley helps you to discover research relevant for your work.