QueryForm: A Simple Zero-shot Form Entity Query Framework

3Citations
Citations of this article
25Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Zero-shot transfer learning for document understanding is a crucial yet under-investigated scenario to help reduce the high cost involved in annotating document entities. We present a novel query-based framework, QueryForm, that extracts entity values from form-like documents in a zero-shot fashion. QueryForm contains a dual prompting mechanism that composes both the document schema and a specific entity type into a query, which is used to prompt a Transformer model to perform a single entity extraction task. Furthermore, we propose to leverage large-scale query-entity pairs generated from form-like webpages with weak HTML annotations to pre-train QueryForm. By unifying pre-training and fine-tuning into the same query-based framework, QueryForm enables models to learn from structured documents containing various entities and layouts, leading to better generalization to target document types without the need for target-specific training data. QueryForm sets new state-of-the-art average F1 score on both the XFUND (+4.6%∼10.1%) and the Payment (+3.2%∼9.5%) zero-shot benchmark, with a smaller model size and no additional image input.

Cite

CITATION STYLE

APA

Wang, Z., Zhang, Z., Devlin, J., Lee, C. Y., Su, G., Zhang, H., … Pfister, T. (2023). QueryForm: A Simple Zero-shot Form Entity Query Framework. In Proceedings of the Annual Meeting of the Association for Computational Linguistics (pp. 4146–4159). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2023.findings-acl.255

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free