Interpreting Text Classifiers by Learning Context-sensitive Influence of Words

Sawan Kumar; Kalpit Dixit; Kashif Shah

Conference ProceedingsOPEN ACCESS

Interpreting Text Classifiers by Learning Context-sensitive Influence of Words

TrustNLP 2021 - 1st Workshop on Trustworthy Natural Language Processing, Proceedings of the Workshop (2021) 55-67

DOI: 10.18653/v1/2021.trustnlp-1.7

2Citations

47Readers

Abstract

Many existing approaches for interpreting text classification models focus on providing importance scores for parts of the input text, such as words, but without a way to test or improve the interpretation method itself. This has the effect of compounding the problem of understanding or building trust in the model, with the interpretation method itself adding to the opacity of the model. Further, importance scores on individual examples are usually not enough to provide a sufficient picture of model behavior. To address these concerns, we propose MOXIE (MOdeling conteXt-sensitive InfluencE of words) with an aim to enable a richer interface for a user to interact with the model being interpreted and to produce testable predictions. In particular, we aim to make predictions for importance scores, counterfactuals and learned biases with MOXIE. In addition, with a global learning objective, MOXIE provides a clear path for testing and improving itself. We evaluate the reliability and efficiency of MOXIE on the task of sentiment analysis.

References Powered by Scopus

View more at Scopus

Cited by Powered by Scopus

View more at Scopus

Cite

CITATION STYLE

APA

Kumar, S., Dixit, K., & Shah, K. (2021). Interpreting Text Classifiers by Learning Context-sensitive Influence of Words. In TrustNLP 2021 - 1st Workshop on Trustworthy Natural Language Processing, Proceedings of the Workshop (pp. 55–67). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2021.trustnlp-1.7

Readers over time

Readers' Seniority

PhD / Post grad / Masters / Doc 8

53%

Researcher 4

27%

Lecturer / Post doc 2

13%

Professor / Associate Prof. 1

Readers' Discipline

Computer Science 14

70%

Linguistics 4

20%

Neuroscience 1

Social Sciences 1

Interpreting Text Classifiers by Learning Context-sensitive Influence of Words

Abstract

References Powered by Scopus

"Why should i trust you?" Explaining the predictions of any classifier

Questioning the AI: Informing Design Practices for Explainable AI User Experiences

Interpretation of neural networks is fragile

Cited by Powered by Scopus

RLAS-BIABC: A Reinforcement Learning-Based Answer Selection Using the BERT Model Boosted by an Improved ABC Algorithm

A survey on extremism analysis using natural language processing: definitions, literature review, trends and challenges

Register to see more suggestions

Cite

Readers over time

Readers' Seniority

Readers' Discipline