Divide Rows and Conquer Cells: Towards Structure Recognition for Large Tables

10Citations
Citations of this article
7Readers
Mendeley users who have this article in their library.

Abstract

Recent advanced Table Structure Recognition (TSR) models adopt image-to-text solutions to parse table structure. These methods can be formulated as image caption problem, i.e., input a single-table image and output table structure description in a specific text format, e.g., HTML. With the impressive success of Transformer in text generation tasks, these methods use Transformer architecture to predict HTML table text in an autoregressive manner. However, tables always emerge with a large variety of shapes and sizes. Autoregressive models usually suffer from the error accumulation problem as the length of predicted text increases, which results in unsatisfactory performance for large tables. In this paper, we propose a novel image-to-text based TSR method that relieves error accumulation problems and improves performance noticeably. At the core of our method is a cascaded two-step decoder architecture with the former decoder predicting HTML table row tags non-autoregressively and the latter predicting HTML table cell tags of each row in a semi-autoregressive manner. Compared with existing methods that predict HTML text autoregressively, the superiority of our row-to-cell progressive table parsing is twofold: (i) it generates an HTML tag sequence with a vertical-and-horizontal two-step 'scanning', which better fits the inherent 2D structure of image data, (ii) it performs substantially better for large tables (long sequence prediction) since it alleviates error accumulation problem specific to autoregressive models. Extensive experiments demonstrate that our method achieves competitive performance on three public benchmarks.

Cite

CITATION STYLE

APA

Shen, H., Gao, X., Wei, J., Qiao, L., Zhou, Y., Li, Q., & Cheng, Z. (2023). Divide Rows and Conquer Cells: Towards Structure Recognition for Large Tables. In IJCAI International Joint Conference on Artificial Intelligence (Vol. 2023-August, pp. 1369–1377). International Joint Conferences on Artificial Intelligence. https://doi.org/10.24963/ijcai.2023/152

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free