UrduThreat@ FIRE2021: Shared Track on Abusive Threat Identification in Urdu

Maaz Amjad; Alisa Zhila; Grigori Sidorov; Andrey Labunets; Sabur Butt; Hamza Imam Amjad; Oxana Vitman; Alexander Gelbukh

Conference ProceedingsOPEN ACCESS

UrduThreat@ FIRE2021: Shared Track on Abusive Threat Identification in Urdu

ACM International Conference Proceeding Series (2021) 9-11

DOI: 10.1145/3503162.3505241

23Citations

5Readers

Get full text

Abstract

With the growth of spread and importance of social media platforms, the effect of their misuse became more and more impactful. This shared task address the task of abusive and threatening language detection in Urdu language that has more than 230 million speakers worldwide. We presented two datasets: (i) Abusive and Non-Abusive language, (ii) Threatening and Non-Threatening language. The abusive dataset contains 1,187 tweets categorized as Abusive and 1,213 as Non-Abusive and the threatening dataset contains 4,929 tweets categorized as Non-Threatening and 1,071 as Threatening. In this shared task, 21 teams registered for participation from six countries (India, Pakistan, China, Malaysia, United Arab Emirates, Taiwan), 10 teams submitted their runs for Subtask A - Abusive Language Detection, 9 teams submitted their runs for Subtask B - Threatening Language detection, and seven teams submitted their technical reports. We provided one baseline system for Subtask A and three baseline systems for Subtask B. The best performing system achieved an F-score value of 0.88 for Subtask A and 0.545 for Subtask B. For both subtasks, m-Bert based transformer models showed the best performance.

Author supplied keywords

Cite

CITATION STYLE

APA

Amjad, M., Zhila, A., Sidorov, G., Labunets, A., Butt, S., Amjad, H. I., … Gelbukh, A. (2021). UrduThreat@ FIRE2021: Shared Track on Abusive Threat Identification in Urdu. In ACM International Conference Proceeding Series (pp. 9–11). Association for Computing Machinery. https://doi.org/10.1145/3503162.3505241

UrduThreat@ FIRE2021: Shared Track on Abusive Threat Identification in Urdu

Abstract

Author supplied keywords

Cite

Register to see more suggestions