Issues with Entailment-based Zero-shot Text Classification

30Citations
Citations of this article
98Readers
Mendeley users who have this article in their library.
Get full text

Abstract

The general format of natural language inference (NLI) makes it tempting to be used for zero-shot text classification by casting any target label into a sentence of hypothesis and verifying whether or not it could be entailed by the input, aiming at generic classification applicable on any specified label space. In this opinion piece, we point out a few overlooked issues that are yet to be discussed in this line of work. We observe huge variance across different classification datasets amongst standard BERT-based NLI models and surprisingly find that pre-trained BERT without any fine-tuning can yield competitive performance against BERT fine-tuned for NLI. With the concern that these models heavily rely on spurious lexical patterns for prediction, we also experiment with preliminary approaches for more robust NLI, but the results are in general negative. Our observations reveal implicit but challenging difficulties in entailmentbased zero-shot text classification.

Cite

CITATION STYLE

APA

Ma, T., Yao, J. G., Lin, C. Y., & Zhao, T. (2021). Issues with Entailment-based Zero-shot Text Classification. In ACL-IJCNLP 2021 - 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, Proceedings of the Conference (Vol. 2, pp. 786–796). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2021.acl-short.99

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free