Improving interaction quality estimation with bilstms and the impact on dialogue policy learning

Stefan Ultes

Conference ProceedingsOPEN ACCESS

Improving interaction quality estimation with bilstms and the impact on dialogue policy learning

Ultes S

SIGDIAL 2019 - 20th Annual Meeting of the Special Interest Group Discourse Dialogue - Proceedings of the Conference (2019) 11-20

DOI: 10.18653/v1/w19-5902

13Citations

67Readers

Abstract

Learning suitable and well-performing dialogue behaviour in statistical spoken dialogue systems has been in the focus of research for many years. While most work which is based on reinforcement learning employs an objective measure like task success for modelling the reward signal, we use a reward based on user satisfaction estimation. We propose a novel estimator and show that it outperforms all previous estimators while learning temporal dependencies implicitly. Furthermore, we apply this novel user satisfaction estimation model live in simulated experiments where the satisfaction estimation model is trained on one domain and applied in many other domains which cover a similar task. We show that applying this model results in higher estimated satisfaction, similar task success rates and a higher robustness to noise.

Cite

CITATION STYLE

APA

Ultes, S. (2019). Improving interaction quality estimation with bilstms and the impact on dialogue policy learning. In SIGDIAL 2019 - 20th Annual Meeting of the Special Interest Group Discourse Dialogue - Proceedings of the Conference (pp. 11–20). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/w19-5902

Improving interaction quality estimation with bilstms and the impact on dialogue policy learning

Abstract

Cite

Register to see more suggestions