Sub-domain modelling for dialogue management with hierarchical reinforcement learning

39Citations
Citations of this article
131Readers
Mendeley users who have this article in their library.

Abstract

Human conversation is inherently complex, often spanning many different topics/domains. This makes policy learning for dialogue systems very challenging. Standard flat reinforcement learning methods do not provide an efficient framework for modelling such dialogues. In this paper, we focus on the under-explored problem of multi-domain dialogue management. First, we propose a new method for hierarchical reinforcement learning using the option framework. Next, we show that the proposed architecture learns faster and arrives at a better policy than the existing flat ones do. Moreover, we show how pretrained policies can be adapted to more complex systems with an additional set of new actions. In doing that, we show that our approach has the potential to facilitate policy optimisation for more sophisticated multi-domain dialogue systems.

Cite

CITATION STYLE

APA

Budzianowski, P., Ultes, S., Su, P. H., Mrkšić, N., Wen, T. H., Casanueva, I., … Gašić, M. (2017). Sub-domain modelling for dialogue management with hierarchical reinforcement learning. In SIGDIAL 2017 - 18th Annual Meeting of the Special Interest Group on Discourse and Dialogue, Proceedings of the Conference (pp. 86–92). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/w17-5512

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free