This paper describes the submissions of our team, HAD-Tübingen, for the SemEval 2019 - Task 6: “OffensEval: Identifying and Categorizing Offensive Language in Social Media”. We participated in all the three sub-tasks: Sub-task A - “Offensive language identification”, sub-task B - “Automatic categorization of offense types” and sub-task C - “Offense target identification”. As a baseline model we used a Long short-term memory recurrent neural network (LSTM) to identify and categorize offensive tweets. For all the tasks we experimented with external databases in a postprocessing step to enhance the results made by our model. The best macro-average F1 scores obtained for the sub-tasks A, B and C are 0.73, 0.52, and 0.37, respectively.
CITATION STYLE
Bansal, H., Nagel, D., & Soloveva, A. (2019). HAD-Tübingen at SemEval-2019 task 6: Deep learning analysis of offensive language on twitter: Identification and categorization. In NAACL HLT 2019 - International Workshop on Semantic Evaluation, SemEval 2019, Proceedings of the 13th Workshop (pp. 622–627). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/s19-2111
Mendeley helps you to discover research relevant for your work.