Abstract
The general format of natural language inference (NLI) makes it tempting to be used for zero-shot text classification by casting any target label into a sentence of hypothesis and verifying whether or not it could be entailed by the input, aiming at generic classification applicable on any specified label space. In this opinion piece, we point out a few overlooked issues that are yet to be discussed in this line of work. We observe huge variance across different classification datasets amongst standard BERT-based NLI models and surprisingly find that pre-trained BERT without any fine-tuning can yield competitive performance against BERT fine-tuned for NLI. With the concern that these models heavily rely on spurious lexical patterns for prediction, we also experiment with preliminary approaches for more robust NLI, but the results are in general negative. Our observations reveal implicit but challenging difficulties in entailmentbased zero-shot text classification.
Cite
CITATION STYLE
Ma, T., Yao, J. G., Lin, C. Y., & Zhao, T. (2021). Issues with Entailment-based Zero-shot Text Classification. In ACL-IJCNLP 2021 - 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, Proceedings of the Conference (Vol. 2, pp. 786–796). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2021.acl-short.99
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.