Semantics Altering Modifications for Evaluating Comprehension in Machine Reading

7Citations
Citations of this article
21Readers
Mendeley users who have this article in their library.

Abstract

Advances in NLP have yielded impressive results for the task of machine reading comprehension (MRC), with approaches having been reported to achieve performance comparable to that of humans. In this paper, we investigate whether stateof- the-art MRC models are able to correctly process Semantics Altering Modifications (SAM): linguistically-motivated phenomena that alter the semantics of a sentence while preserving most of its lexical surface form. We present a method to automatically generate and align challenge sets featuring original and altered examples. We further propose a novel evaluation methodology to correctly assess the capability of MRC systems to process these examples independent of the data they were optimised on, by discounting for effects introduced by domain shift. In a large-scale empirical study, we apply the methodology in order to evaluate extractive MRC models with regard to their capability to correctly process SAM-enriched data. We comprehensively cover 12 different state-of-the-art neural architecture configurations and four training datasets and find that - despite their well-known remarkable performance - optimised models consistently struggle to correctly process semantically altered data.

References Powered by Scopus

SQuad: 100,000+ questions for machine comprehension of text

4028Citations
2234Readers

Adversarial examples for evaluating reading comprehension systems

902Citations
926Readers
Get full text

Cited by Powered by Scopus

Measure and Improve Robustness in NLP Models: A Survey

65Citations
119Readers

This article is free to access.

Chain-of-Questions Training with Latent Answers for Robust Multistep Question Answering

2Citations
16Readers

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Schlegel, V., Nenadic, G., & Batista-Navarro, R. (2021). Semantics Altering Modifications for Evaluating Comprehension in Machine Reading. In 35th AAAI Conference on Artificial Intelligence, AAAI 2021 (Vol. 15, pp. 13762–13770). Association for the Advancement of Artificial Intelligence. https://doi.org/10.1609/aaai.v35i15.17622

Readers over time

‘20‘21‘22‘230481216

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 8

73%

Researcher 3

27%

Readers' Discipline

Tooltip

Computer Science 10

83%

Physics and Astronomy 1

8%

Business, Management and Accounting 1

8%

Save time finding and organizing research with Mendeley

Sign up for free
0