Natural language explanations have the potential to provide rich information that in principle guides model reasoning. Yet, recent work by Lampinen et al. (2022) has shown limited utility of natural language explanations in improving classification. To effectively learn from explanations, we present FLamE, a two-stage few-shot learning framework that first generates explanations using GPT-3, and then fine-tunes a smaller model (e.g., RoBERTa) with generated explanations. Our experiments on natural language inference demonstrate effectiveness over strong baselines, increasing accuracy by 17.6% over GPT-3 Babbage and 5.7% over GPT-3 Davinci in e-SNLI. Despite improving classification performance, human evaluation surprisingly reveals that the majority of generated explanations does not adequately justify classification decisions. Additional analyses point to the important role of label-specific cues (e.g., “not know” for the neutral label) in generated explanations.
CITATION STYLE
Zhou, Y., Zhang, Y., & Tan, C. (2023). FLamE: Few-shot Learning from Natural Language Explanations. In Proceedings of the Annual Meeting of the Association for Computational Linguistics (Vol. 1, pp. 6743–6763). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2023.acl-long.372
Mendeley helps you to discover research relevant for your work.