Classical Out-of-Distribution Detection Methods Benchmark in Text Classification Tasks

5Citations
Citations of this article
12Readers
Mendeley users who have this article in their library.

Abstract

State-of-the-art models can perform well in controlled environments, but they often struggle when presented with out-of-distribution (OOD) examples, making OOD detection a critical component of NLP systems. In this paper, we focus on highlighting the limitations of existing approaches to OOD detection in NLP. Specifically, we evaluated eight OOD detection methods that are easily integrable into existing NLP systems and require no additional OOD data or model modifications. One of our contributions is providing a well-structured research environment that allows for full reproducibility of the results. Additionally, our analysis shows that existing OOD detection methods for NLP tasks are not yet sufficiently sensitive to capture all samples characterized by various types of distributional shifts. Particularly challenging testing scenarios arise in cases of background shift and randomly shuffled word order within in domain texts. This highlights the need for future work to develop more effective OOD detection approaches for the NLP problems, and our work provides a well-defined foundation for further research in this area.

References Powered by Scopus

Understanding bag-of-words model: A statistical framework

1059Citations
N/AReaders
Get full text

Improved Adam Optimizer for Deep Neural Networks

946Citations
N/AReaders
Get full text

A survey of human-in-the-loop for machine learning

358Citations
N/AReaders
Get full text

Cited by Powered by Scopus

Investigation of out-of-distribution detection across various models and training methodologies

1Citations
N/AReaders
Get full text

Beta Distribution Approach for Outlier Exposure in Multi-class Text Classification

0Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Baran, M., Baran, J., Wójcik, M., Zięba, M., & Gonczarek, A. (2023). Classical Out-of-Distribution Detection Methods Benchmark in Text Classification Tasks. In Proceedings of the Annual Meeting of the Association for Computational Linguistics (Vol. 4, pp. 119–129). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2023.acl-srw.20

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 2

50%

Lecturer / Post doc 1

25%

Researcher 1

25%

Readers' Discipline

Tooltip

Computer Science 7

88%

Medicine and Dentistry 1

13%

Save time finding and organizing research with Mendeley

Sign up for free