Sanity checks for saliency metrics

Richard Tomsett; Dan Harborne; Supriyo Chakraborty; Prudhvi Gurram; Alun Preece

Conference ProceedingsOPEN ACCESS

Sanity checks for saliency metrics

AAAI 2020 - 34th AAAI Conference on Artificial Intelligence (2020) 6021-6029

DOI: 10.1609/aaai.v34i04.6064

82Citations

102Readers

Abstract

Saliency maps are a popular approach to creating post-hoc explanations of image classifier outputs. These methods produce estimates of the relevance of each pixel to the classification output score, which can be displayed as a saliency map that highlights important pixels. Despite a proliferation of such methods, little effort has been made to quantify how good these saliency maps are at capturing the true relevance of the pixels to the classifier output (i.e. their “fidelity”). We therefore investigate existing metrics for evaluating the fidelity of saliency methods (i.e. saliency metrics). We find that there is little consistency in the literature in how such metrics are calculated, and show that such inconsistencies can have a significant effect on the measured fidelity. Further, we apply measures of reliability developed in the psychometric testing literature to assess the consistency of saliency metrics when applied to individual saliency maps. Our results show that saliency metrics can be statistically unreliable and inconsistent, indicating that comparative rankings between saliency methods generated using such metrics can be untrustworthy.

Cite

CITATION STYLE

APA

Tomsett, R., Harborne, D., Chakraborty, S., Gurram, P., & Preece, A. (2020). Sanity checks for saliency metrics. In AAAI 2020 - 34th AAAI Conference on Artificial Intelligence (pp. 6021–6029). AAAI press. https://doi.org/10.1609/aaai.v34i04.6064

Sanity checks for saliency metrics

Abstract

Cite

Register to see more suggestions