Improving understandability of feature contributions in model-agnostic explainable AI tools

Sophia Hadash; Martijn C. Willemsen; Chris Snijders; Wijnand A. Ijsselsteijn

Conference ProceedingsOPEN ACCESS

Improving understandability of feature contributions in model-agnostic explainable AI tools

Conference on Human Factors in Computing Systems - Proceedings (2022)

DOI: 10.1145/3491102.3517650

18Citations

43Readers

Abstract

Model-agnostic explainable AI tools explain their predictions by means of 'local' feature contributions. We empirically investigate two potential improvements over current approaches. The first one is to always present feature contributions in terms of the contribution to the outcome that is perceived as positive by the user ("positive framing"). The second one is to add "semantic labeling", that explains the directionality of each feature contribution ("this feature leads to +5% eligibility"), reducing additional cognitive processing steps. In a user study, participants evaluated the understandability of explanations for different framing and labeling conditions for loan applications and music recommendations. We found that positive framing improves understandability even when the prediction is negative. Additionally, adding semantic labels eliminates any framing effects on understandability, with positive labels outperforming negative labels. We implemented our suggestions in a package ArgueView[11].

Author supplied keywords

Cite

CITATION STYLE

APA

Hadash, S., Willemsen, M. C., Snijders, C., & Ijsselsteijn, W. A. (2022). Improving understandability of feature contributions in model-agnostic explainable AI tools. In Conference on Human Factors in Computing Systems - Proceedings. Association for Computing Machinery. https://doi.org/10.1145/3491102.3517650

Improving understandability of feature contributions in model-agnostic explainable AI tools

Abstract

Author supplied keywords

Cite

Register to see more suggestions