OpenPI-C: A Better Benchmark and Stronger Baseline for Open-Vocabulary State Tracking

1Citations
Citations of this article
13Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Open-vocabulary state tracking is a more practical version of state tracking that aims to track state changes of entities throughout a process without restricting the state space and entity space. OpenPI (Tandon et al., 2020) is to date the only dataset annotated for open-vocabulary state tracking. However, we identify issues with the dataset quality and evaluation metric. For the dataset, we categorize 3 types of problems on the procedure level, step level and state change level respectively, and build a clean dataset OpenPI-C using multiple rounds of human judgment. For the evaluation metric, we propose a cluster-based metric to fix the original metric's preference for repetition. Model-wise, we enhance the seq2seq generation baseline by reinstating two key properties for state tracking: temporal dependency and entity awareness. The state of the world after an action is inherently dependent on the previous state. We model this dependency through a dynamic memory bank and allow the model to attend to the memory slots during decoding. On the other hand, the state of the world is naturally a union of the states of involved entities. Since the entities are unknown in the open-vocabulary setting, we propose a two-stage model that refines the state change prediction conditioned on entities predicted from the first stage. Empirical results show the effectiveness of our proposed model especially on the cluster-based metric. The code and data are released at https://github.com/shirley-wu/openpi-c.

Cite

CITATION STYLE

APA

Wu, X., Li, S., & Ji, H. (2023). OpenPI-C: A Better Benchmark and Stronger Baseline for Open-Vocabulary State Tracking. In Proceedings of the Annual Meeting of the Association for Computational Linguistics (pp. 7213–7222). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2023.findings-acl.452

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free