Domain-independent User Simulation with Transformers for Task-oriented Dialogue Systems

23Citations
Citations of this article
63Readers
Mendeley users who have this article in their library.

Abstract

Dialogue policy optimisation via reinforcement learning requires a large number of training interactions, which makes learning with real users time consuming and expensive. Many set-ups therefore rely on a user simulator instead of humans. These user simulators have their own problems. While hand-coded, rule-based user simulators have been shown to be sufficient in small, simple domains, for complex domains the number of rules quickly becomes intractable. State-of-the-art datadriven user simulators, on the other hand, are still domain-dependent. This means that adaptation to each new domain requires redesigning and retraining. In this work, we propose a domain-independent transformer-based user simulator (TUS). The structure of our TUS is not tied to a specific domain, enabling domain generalisation and learning of cross-domain user behaviour from data. We compare TUS with the state of the art using automatic as well as human evaluations. TUS can compete with rule-based user simulators on pre-defined domains and is able to generalise to unseen domains in a zero-shot fashion.

Cite

CITATION STYLE

APA

Lin, H. C., Lubis, N., Hu, S., Van Niekerk, C., Geishauser, C., Heck, M., … Gašić, M. (2021). Domain-independent User Simulation with Transformers for Task-oriented Dialogue Systems. In SIGDIAL 2021 - 22nd Annual Meeting of the Special Interest Group on Discourse and Dialogue, Proceedings of the Conference (pp. 445–456). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2021.sigdial-1.47

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free