Self-attentive, multi-context one-class classification for unsupervised anomaly detection on text

Lukas Ruff; Yury Zemlyanskiy; Robert Vandermeulen; Thomas Schnake; Marius Kloft

Conference ProceedingsOPEN ACCESS

Self-attentive, multi-context one-class classification for unsupervised anomaly detection on text

ACL 2019 - 57th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference (2020) 4061-4071

DOI: 10.18653/v1/p19-1398

45Citations

147Readers

Abstract

There exist few text-specific methods for unsupervised anomaly detection, and for those that do exist, none utilize pre-trained models for distributed vector representations of words. In this paper we introduce a new anomaly detection method-Context Vector Data Description (CVDD)-which builds upon word embedding models to learn multiple sentence representations that capture multiple semantic contexts via the self-attention mechanism. Modeling multiple contexts enables us to perform contextual anomaly detection of sentences and phrases with respect to the multiple themes and concepts present in an unlabeled text corpus. These contexts in combination with the self-attention weights make our method highly interpretable. We demonstrate the effectiveness of CVDD quantitatively as well as qualitatively on the well-known Reuters, 20 Newsgroups, and IMDB Movie Reviews datasets.

Cite

CITATION STYLE

APA

Ruff, L., Zemlyanskiy, Y., Vandermeulen, R., Schnake, T., & Kloft, M. (2020). Self-attentive, multi-context one-class classification for unsupervised anomaly detection on text. In ACL 2019 - 57th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference (pp. 4061–4071). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/p19-1398

Self-attentive, multi-context one-class classification for unsupervised anomaly detection on text

Abstract

Cite

Register to see more suggestions