Abstract
State-of-the-art approaches to identifying offensive language online make use of large pre-trained transformer models. However, the inference time, disk, and memory requirements of these transformer models present challenges for their wide usage in the real world. Even the distilled transformer models remain prohibitively large for many usage scenarios. To cope with these challenges, in this paper, we propose transferring knowledge from transformer models to much smaller neural models to make predictions at the token- and at the post-level. We show that this approach leads to lightweight offensive language identification models that perform on par with large transformers but with 100 times fewer parameters and much less memory usage.
Cite
CITATION STYLE
Ranasinghe, T., & Zampieri, M. (2023). Teacher and Student Models of Offensive Language in Social Media. In Proceedings of the Annual Meeting of the Association for Computational Linguistics (pp. 3910–3922). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2023.findings-acl.241
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.