On the Interaction between Annotation Quality and Classifier Performance in Abusive Language Detection

3Citations
Citations of this article
44Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Abusive language detection has become an important tool for the cultivation of safe online platforms. We investigate the interaction of annotation quality and classifier performance. We use a new, fine-grained annotation scheme that allows us to distinguish between abusive language and colloquial uses of profanity that are not meant to harm. Our results show a tendency of crowd workers to overuse the abusive class, which creates an unrealistic class balance and affects classification accuracy. We also investigate different methods of distinguishing between explicit and implicit abuse and show lexicon-based approaches either over- or under-estimate the proportion of explicit abuse in data sets.

Cite

CITATION STYLE

APA

Long, H. L., O’Neil, A., & Kbler, S. (2021). On the Interaction between Annotation Quality and Classifier Performance in Abusive Language Detection. In International Conference Recent Advances in Natural Language Processing, RANLP (pp. 868–875). Incoma Ltd. https://doi.org/10.26615/978-954-452-072-4_099

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free