Improving the robustness of question answering systems to question paraphrasing

Wee Chung Gan; Hwee Tou Ng

Conference ProceedingsOPEN ACCESS

Improving the robustness of question answering systems to question paraphrasing

ACL 2019 - 57th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference (2020) 6065-6075

DOI: 10.18653/v1/p19-1610

86Citations

196Readers

Abstract

Despite the advancement of question answering (QA) systems and rapid improvements on held-out test sets, their generalizability is a topic of concern. We explore the robustness of QA models to question paraphrasing by creating two test sets consisting of paraphrased SQuAD questions. Paraphrased questions from the first test set are very similar to the original questions designed to test QA models' over-sensitivity, while questions from the second test set are paraphrased using context words near an incorrect answer candidate in an attempt to confuse QA models. We show that both paraphrased test sets lead to significant decrease in performance on multiple state-of-the-art QA models. Using a neural paraphrasing model trained to generate multiple paraphrased questions for a given source question and a set of paraphrase suggestions, we propose a data augmentation approach that requires no human intervention to re-train the models for improved robustness to question paraphrasing.

Cite

CITATION STYLE

APA

Gan, W. C., & Ng, H. T. (2020). Improving the robustness of question answering systems to question paraphrasing. In ACL 2019 - 57th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference (pp. 6065–6075). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/p19-1610

Improving the robustness of question answering systems to question paraphrasing

Abstract

Cite

Register to see more suggestions