Smatgrisene at SemEval-2020 Task 12: Offense detection by AI - with a pinch of real I

1Citations
Citations of this article
60Readers
Mendeley users who have this article in their library.

Abstract

This paper discusses how ML based classifiers can be enhanced disproportionately by adding small amounts of qualitative linguistic knowledge. As an example we present the Danish classifier Smatgrisene, our contribution to the recent OffensEval Challenge 2020. The classifier was trained on 3000 social media posts annotated for offensiveness, supplemented by rules extracted from the reference work on Danish offensive language (Rathje 2014b). Smatgrisene did surprisingly well in the competition in spite of its extremely simple design, showing an interesting trade-off between technological muscle and linguistic intelligence. Finally, we comment on the perspectives in combining qualitative and quantitative methods for NLP..

Cite

CITATION STYLE

APA

Henrichsen, P. J., & Rathje, M. (2020). Smatgrisene at SemEval-2020 Task 12: Offense detection by AI - with a pinch of real I. In 14th International Workshops on Semantic Evaluation, SemEval 2020 - co-located 28th International Conference on Computational Linguistics, COLING 2020, Proceedings (pp. 2140–2145). International Committee for Computational Linguistics. https://doi.org/10.18653/v1/2020.semeval-1.284

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free