A Novel Knowledge-augmented Model Customization Approach for Arabic Offensive Language Detection

0Citations
Citations of this article
12Readers
Mendeley users who have this article in their library.

Abstract

Multiple attempts to develop systems for detecting online Arabic offensive language have been explored in previous studies. However, most of these attempts do not consider the variation of Arabic dialects, cultures, and offensive phrases. In contrast, this study aims to extract knowledge from multiple offensive language datasets to build a cross-dialect and culture knowledge-based repository. This knowledge-based repository is utilized to develop novel system architecture based on customizing the AraBERT model in a unique method to preserve dialectal knowledge and offensive cultural knowledge within the contextual word embedding of BERT architecture. Performance evaluation procedures consist of statistical evaluation metrics and a behavioral checklist. Results report more effective predictions by the customized model than the uncustomized one, particularly for offensive text. The customization process allows the model to gain more knowledge of informal text in general, and a better understanding of dialectal Arabic.

Cite

CITATION STYLE

APA

Husain, F. (2023). A Novel Knowledge-augmented Model Customization Approach for Arabic Offensive Language Detection. ACM Transactions on Asian and Low-Resource Language Information Processing, 22(12). https://doi.org/10.1145/3634702

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free