Is Attention Interpretation? A Quantitative Assessment on Sets

Jonathan Haab; Nicolas Deutschmann; María Rodríguez Martínez

Conference Proceedings

Is Attention Interpretation? A Quantitative Assessment on Sets

Communications in Computer and Information Science (2023) 1752 CCIS 303-321

DOI: 10.1007/978-3-031-23618-1_21

2Citations

4Readers

Get full text

Abstract

The debate around the interpretability of attention mechanisms is centered on whether attention scores can be used as a proxy for the relative amounts of signal carried by sub-components of data. We propose to study the interpretability of attention in the context of set machine learning, where each data point is composed of an unordered collection of instances with a global label. For classical multiple-instance-learning problems and simple extensions, there is a well-defined “importance” ground truth that can be leveraged to cast interpretation as a binary classification problem, which we can quantitatively evaluate. By building synthetic datasets over several data modalities, we perform a systematic assessment of attention-based interpretations. We find that attention distributions are indeed often reflective of the relative importance of individual instances, but that silent failures happen where a model will have high classification performance but attention patterns that do not align with expectations. Based on these observations, we propose to use ensembling to minimize the risk of misleading attention-based explanations.

Author supplied keywords

Cite

CITATION STYLE

APA

Haab, J., Deutschmann, N., & Martínez, M. R. (2023). Is Attention Interpretation? A Quantitative Assessment on Sets. In Communications in Computer and Information Science (Vol. 1752 CCIS, pp. 303–321). Springer Science and Business Media Deutschland GmbH. https://doi.org/10.1007/978-3-031-23618-1_21

Is Attention Interpretation? A Quantitative Assessment on Sets

Abstract

Author supplied keywords

Cite

Register to see more suggestions