UVA Wahoos at SemEval-2019 task 6: Hate speech identification using ensemble machine learning

10Citations
Citations of this article
75Readers
Mendeley users who have this article in their library.

Abstract

With the growth in the usage of social media, it has become increasingly common for people to hide behind a mask and abuse others. We have attempted to detect such tweets and comments that are malicious in intent, which either targets an individual or a group. Our best classifier for identifying offensive tweets for SubTask A (Classifying offensive vs. non-offensive) has an accuracy of 83.14% and a f1-score of 0.7565 on the actual test data. For SubTask B, to identify if an offensive tweet is targeted (If targeted towards an individual or a group), the classifier performs with an accuracy of 89.17% and f1-score of 0.5885. The paper talks about how we generated linguistic and semantic features to build an ensemble machine learning model. By training with more extracts from different sources (Face-book, and more tweets), the paper shows how the accuracy changes with additional training data.

Cite

CITATION STYLE

APA

Ramakrishnan, M., Zadrozny, W., & Tabari, N. (2019). UVA Wahoos at SemEval-2019 task 6: Hate speech identification using ensemble machine learning. In NAACL HLT 2019 - International Workshop on Semantic Evaluation, SemEval 2019, Proceedings of the 13th Workshop (pp. 806–811). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/s19-2141

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free