GoChat: Goal-oriented Chatbots with Hierarchical Reinforcement Learning

Jianfeng Liu; Feiyang Pan; Ling Luo

Conference ProceedingsOPEN ACCESS

GoChat: Goal-oriented Chatbots with Hierarchical Reinforcement Learning

SIGIR 2020 - Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval (2020) 1793-1796

DOI: 10.1145/3397271.3401250

24Citations

53Readers

Get full text

Abstract

A chatbot that converses like a human should be goal-oriented (i.e., be purposeful in conversation), which is beyond language generation. However, existing goal-oriented dialogue systems often heavily rely on cumbersome hand-crafted rules or costly labelled datasets, which limits the applicability. In this paper, we propose Goal-oriented Chatbots (GoChat), a framework for end-to-end training the chatbot to maximize the long-term return from offline multi-turn dialogue datasets. Our framework utilizes hierarchical reinforcement learning (HRL), where the high-level policy determines some sub-goals to guide the conversation towards the final goal, and the low-level policy fulfills the sub-goals by generating the corresponding utterance for response. In our experiments conducted on a real-world dialogue dataset for anti-fraud in financial, our approach outperforms previous methods on both the quality of response generation as well as the success rate of accomplishing the goal.

Author supplied keywords

References Powered by Scopus

View more at Scopus

Cited by Powered by Scopus

View more at Scopus

Cite

CITATION STYLE

APA

Liu, J., Pan, F., & Luo, L. (2020). GoChat: Goal-oriented Chatbots with Hierarchical Reinforcement Learning. In SIGIR 2020 - Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval (pp. 1793–1796). Association for Computing Machinery, Inc. https://doi.org/10.1145/3397271.3401250

Readers over time

Readers' Seniority

PhD / Post grad / Masters / Doc 15

65%

Researcher 7

30%

Lecturer / Post doc 1

Readers' Discipline

Computer Science 20

71%

Engineering 3

11%

Business, Management and Accounting 3

11%

Neuroscience 2

GoChat: Goal-oriented Chatbots with Hierarchical Reinforcement Learning

Abstract

Author supplied keywords

References Powered by Scopus

Hierarchical attention networks for document classification

A diversity-promoting objective function for neural conversation models

How to make context more useful? An empirical study on context-Aware neural conversational models

Cited by Powered by Scopus

A knowledge infused context driven dialogue agent for disease diagnosis using hierarchical reinforcement learning

Multimodal Dialog System: Relational Graph-based Context-aware Question Understanding

Error Correction and Adaptation in Conversational AI: A Review of Techniques and Applications in Chatbots

Register to see more suggestions

Cite

Readers over time

Readers' Seniority

Readers' Discipline