What if This Modified That? Syntactic Interventions via Counterfactual Embeddings

17Citations
Citations of this article
58Readers
Mendeley users who have this article in their library.

Abstract

Neural language models exhibit impressive performance on a variety of tasks, but their internal reasoning may be difficult to understand. Prior art aims to uncover meaningful properties within model representations via probes, but it is unclear how faithfully such probes portray information that the models actually use. To overcome such limitations, we propose a technique, inspired by causal analysis, for generating counterfactual embeddings within models. In experiments testing our technique, we produce evidence that suggests some BERT-based models use a tree-distance-like representation of syntax in downstream prediction tasks.

Cite

CITATION STYLE

APA

Tucker, M., Qian, P., & Levy, R. P. (2021). What if This Modified That? Syntactic Interventions via Counterfactual Embeddings. In Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021 (pp. 862–875). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2021.findings-acl.76

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free