Verifying text summaries of relational data sets

16Citations
Citations of this article
36Readers
Mendeley users who have this article in their library.
Get full text

Abstract

We present a novel natural language query interface, the AggChecker, aimed at text summaries of relational data sets. The tool focuses on natural language claims that translate into an SQL query and a claimed query result. Similar in spirit to a spell checker, the AggChecker marks up text passages that seem to be inconsistent with the actual data. At the heart of the system is a probabilistic model that reasons about the input document in a holistic fashion. Based on claim keywords and the document structure, it maps each text claim to a probability distribution over associated query translations. By efficiently executing tens to hundreds of thousands of candidate translations for a typical input document, the system maps text claims to correctness probabilities. This process becomes practical via a specialized processing back-end, avoiding redundant work via query merging and result caching. Verification is an interactive process in which users are shown tentative results, enabling them to take corrective actions if necessary. We tested our system on 53 publicly available articles containing 392 claims. Our tool revealed erroneous claims in roughly a third of test cases. Also, AggChecker compares favorably against several automated and semi-automated fact checking baselines.

Cite

CITATION STYLE

APA

Jo, S., Trummer, I., Yu, W., Wang, X., Yu, C., Liu, D., & Mehta, N. (2019). Verifying text summaries of relational data sets. In Proceedings of the ACM SIGMOD International Conference on Management of Data (pp. 299–316). Association for Computing Machinery. https://doi.org/10.1145/3299869.3300074

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free