A POMDP Dialogue Policy with 3-way Grounding and Adaptive Sensing for Learning through Communication

0Citations
Citations of this article
16Readers
Mendeley users who have this article in their library.

Abstract

Agents to assist with rescue, surgery, and similar activities could collaborate better with humans if they could learn new strategic behaviors through communication. We introduce a novel POMDP dialogue policy for learning from people. The policy has 3-way grounding of language in the shared physical context, the dialogue context, and persistent knowledge. It can learn distinct but related games, and can continue learning across dialogues for complex games. A novel sensing component supports adaptation to information-sharing differences across people. The single policy performs better than oracle policies customized to specific games and information behavior.

Cite

CITATION STYLE

APA

Zare, M., Wagner, A. R., & Passonneau, R. J. (2022). A POMDP Dialogue Policy with 3-way Grounding and Adaptive Sensing for Learning through Communication. In Findings of the Association for Computational Linguistics: EMNLP 2022 (pp. 6796–6809). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2022.findings-emnlp.504

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free