EM Pre-training for Multi-party Dialogue Response Generation

Yiyang Li; Hai Zhao

Conference ProceedingsOPEN ACCESS

EM Pre-training for Multi-party Dialogue Response Generation

Proceedings of the Annual Meeting of the Association for Computational Linguistics (2023) 1 92-103

DOI: 10.18653/v1/2023.acl-long.7

8Citations

15Readers

Abstract

Dialogue response generation requires an agent to generate a response according to the current dialogue history, in terms of which two-party dialogues have been well studied, but leaving a great gap for multi-party dialogues at the same time. Different from two-party dialogues where each response is a direct reply to its previous utterance, the addressee of a response utterance should be specified before it is generated in the multi-party scenario. Thanks to the huge amount of two-party conversational data, various pre-trained language models for two-party dialogue response generation have been proposed. However, due to the lack of annotated addressee labels in multi-party dialogue datasets, it is hard to use them to pre-train a response generation model for multi-party dialogues. To tackle this obstacle, we propose an Expectation-Maximization (EM) approach that iteratively performs the expectation steps to generate addressee labels, and the maximization steps to optimize a response generation model. Theoretical analyses and extensive experiments have justified the feasibility and effectiveness of our proposed method. The official implementation of this paper is available at https://github.com/EricLee8/MPDRG.

References Powered by Scopus

View more at Scopus

Cited by Powered by Scopus

View more at Scopus

Cite

CITATION STYLE

APA

Li, Y., & Zhao, H. (2023). EM Pre-training for Multi-party Dialogue Response Generation. In Proceedings of the Annual Meeting of the Association for Computational Linguistics (Vol. 1, pp. 92–103). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2023.acl-long.7

Readers' Seniority

PhD / Post grad / Masters / Doc 2

67%

Lecturer / Post doc 1

33%

Readers' Discipline

Computer Science 5

83%

Medicine and Dentistry 1

17%

EM Pre-training for Multi-party Dialogue Response Generation

Abstract

References Powered by Scopus

The Ubuntu Dialogue Corpus: A large dataset for research in unstructured multi-turn Dialogue systems

A deep sequential model for discourse parsing on multi-party dialogues

GSN: A graph-structured network for multi-party dialogues

Cited by Powered by Scopus

Ualign: pushing the limit of template-free retrosynthesis prediction with unsupervised SMILES alignment

MADNet: Maximizing Addressee Deduction Expectation for Multi-Party Conversation Generation

MIRROR: Multi-party dialogue generation based on interpersonal relationship-aware persona retrieval

Register to see more suggestions

Cite

Readers' Seniority

Readers' Discipline