Aggressive language in an online hacking forum

Andrew Caines; Sergio Pastrana; Alice Hutchings; Paula Buttery

Conference ProceedingsOPEN ACCESS

Aggressive language in an online hacking forum

2nd Workshop on Abusive Language Online - Proceedings of the Workshop, co-located with EMNLP 2018 (2018) 66-74

DOI: 10.18653/v1/w18-5109

5Citations

84Readers

Abstract

We probe the heterogeneity in levels of abusive language in different sections of the Internet, using an annotated corpus of Wikipedia page edit comments to train a binary classifier for abuse detection. Our test data come from the CrimeBB Corpus of hacking-related forum posts and we find that (a) forum interactions are rarely abusive, (b) the abusive language which does exist tends to be relatively mild compared to that found in the Wikipedia comments domain, and tends to involve aggressive posturing rather than hate speech or threats of violence. We observe that the purpose of conversations in online forums tend to be more constructive and informative than those in Wikipedia page edit comments which are geared more towards adversarial interactions, and that this may explain the lower levels of abuse found in our forum data than in Wikipedia comments. Further work remains to be done to compare these results with other inter-domain classification experiments, and to understand the impact of aggressive language in forum conversations.

Cite

CITATION STYLE

APA

Caines, A., Pastrana, S., Hutchings, A., & Buttery, P. (2018). Aggressive language in an online hacking forum. In 2nd Workshop on Abusive Language Online - Proceedings of the Workshop, co-located with EMNLP 2018 (pp. 66–74). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/w18-5109

Aggressive language in an online hacking forum

Abstract

Cite

Register to see more suggestions