Deep Reinforcement Learning for On-line Dialogue State Tracking

Zhi Chen; Lu Chen; Xiang Zhou; Kai Yu

Conference Proceedings

Deep Reinforcement Learning for On-line Dialogue State Tracking

Communications in Computer and Information Science (2023) 1765 CCIS 278-292

DOI: 10.1007/978-981-99-2401-1_25

1Citations

9Readers

Get full text

Abstract

Dialogue state tracking (DST) is a crucial module in dialogue management. It is usually cast as a supervised training problem, which is not convenient for on-line optimization. In this paper, a novel companion teaching based deep reinforcement learning (DRL) framework for on-line DST optimization is proposed. To the best of our knowledge, this is the first effort to optimize the DST module within DRL framework for on-line task-oriented spoken dialogue systems. In addition, dialogue policy can be further jointly updated. Experiments show that on-line DST optimization can effectively improve the dialogue manager performance while keeping the flexibility of using predefined policy. Joint training of both DST and policy can further improve the performance.

Author supplied keywords

Cite

CITATION STYLE

APA

Chen, Z., Chen, L., Zhou, X., & Yu, K. (2023). Deep Reinforcement Learning for On-line Dialogue State Tracking. In Communications in Computer and Information Science (Vol. 1765 CCIS, pp. 278–292). Springer Science and Business Media Deutschland GmbH. https://doi.org/10.1007/978-981-99-2401-1_25

Deep Reinforcement Learning for On-line Dialogue State Tracking

Abstract

Author supplied keywords

Cite

Register to see more suggestions