Strategy and Policy Learning for Non-Task-Oriented Conversational Systems

Zhou Yu; Ziyu Xu; Alan W. Black; Alex I. Rudnicky

Conference Proceedings

Strategy and Policy Learning for Non-Task-Oriented Conversational Systems

SIGDIAL 2016 - 17th Annual Meeting of the Special Interest Group on Discourse and Dialogue, Proceedings of the Conference (2016) 404-412

DOI: 10.18653/v1/w16-3649

84Citations

161Readers

Get full text

Abstract

We propose a set of generic conversational strategies to handle possible system breakdowns in non-task-oriented dialog systems. We also design policies to select these strategies according to dialog context. We combine expert knowledge and the statistical findings derived from data in designing these policies. The policy learned via reinforcement learning outperforms the random selection policy and the locally greedy policy in both simulated and real-world settings. In addition, we propose three metrics for conversation quality evaluation which consider both the local and global quality of the conversation.

Cite

CITATION STYLE

APA

Yu, Z., Xu, Z., Black, A. W., & Rudnicky, A. I. (2016). Strategy and Policy Learning for Non-Task-Oriented Conversational Systems. In SIGDIAL 2016 - 17th Annual Meeting of the Special Interest Group on Discourse and Dialogue, Proceedings of the Conference (pp. 404–412). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/w16-3649

Strategy and Policy Learning for Non-Task-Oriented Conversational Systems

Abstract

Cite

Register to see more suggestions