We generate natural language explanations for a fine-grained visual recognition task. Our explanations fulfill two criteria. First, explanations are class discriminative, meaning they mention attributes in an image which are important to identify a class. Second, explanations are image relevant, meaning they reflect the actual content of an image. Our system, composed of an explanation sampler and phrase-critic model, generates class discriminative and image relevant explanations. In addition, we demonstrate that our explanations can help humans decide whether to accept or reject an AI decision.
CITATION STYLE
Hendricks, L. A., Rohrbach, A., Schiele, B., Darrell, T., & Akata, Z. (2021, December 1). Generating visual explanations with natural language. Applied AI Letters. John Wiley and Sons Inc. https://doi.org/10.1002/ail2.55
Mendeley helps you to discover research relevant for your work.