Exploring Schema Generalizability of Text-to-SQL

Jieyu Li; Lu Chen; Ruisheng Cao; Su Zhu; Hongshen Xu; Zhi Chen; Hanchong Zhang; Kai Yu

Conference ProceedingsOPEN ACCESS

Exploring Schema Generalizability of Text-to-SQL

Proceedings of the Annual Meeting of the Association for Computational Linguistics (2023) 1344-1360

DOI: 10.18653/v1/2023.findings-acl.87

0Citations

10Readers

Abstract

Exploring the generalizability of a text-to-SQL parser is essential for a system to automatically adapt the real-world databases. Previous investigation works mostly focus on lexical diversity, including the influence of the synonym and perturbations in both natural language questions and databases. However, the structural variability of database schema (DS), as a widely seen real-world scenario, is yet underexplored. Specifically, confronted with the same input question, the target SQL may be represented in different ways when the DS comes to a different structure. In this work, we provide in-depth discussions about the schema generalizability challenge of text-to-SQL tasks. We observe that current datasets are too templated to study schema generalization. To collect suitable test data, we propose a framework to generate novel text-to-SQL data via automatic and synchronous (DS, SQL) pair altering. When evaluating state-of-the-art text-to-SQL models on the synthetic samples, performance is significantly degraded, which demonstrates the limitation of current research regarding schema generalization.

Cite

CITATION STYLE

APA

Li, J., Chen, L., Cao, R., Zhu, S., Xu, H., Chen, Z., … Yu, K. (2023). Exploring Schema Generalizability of Text-to-SQL. In Proceedings of the Annual Meeting of the Association for Computational Linguistics (pp. 1344–1360). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2023.findings-acl.87

Exploring Schema Generalizability of Text-to-SQL

Abstract

Cite

Register to see more suggestions