A Novel Knowledge-augmented Model Customization Approach for Arabic Offensive Language Detection

Fatemah Husain

Journal ArticleOPEN ACCESS

A Novel Knowledge-augmented Model Customization Approach for Arabic Offensive Language Detection

Husain F

ACM Transactions on Asian and Low-Resource Language Information Processing (2023) 22(12)

DOI: 10.1145/3634702

0Citations

12Readers

Abstract

Multiple attempts to develop systems for detecting online Arabic offensive language have been explored in previous studies. However, most of these attempts do not consider the variation of Arabic dialects, cultures, and offensive phrases. In contrast, this study aims to extract knowledge from multiple offensive language datasets to build a cross-dialect and culture knowledge-based repository. This knowledge-based repository is utilized to develop novel system architecture based on customizing the AraBERT model in a unique method to preserve dialectal knowledge and offensive cultural knowledge within the contextual word embedding of BERT architecture. Performance evaluation procedures consist of statistical evaluation metrics and a behavioral checklist. Results report more effective predictions by the customized model than the uncustomized one, particularly for offensive text. The customization process allows the model to gain more knowledge of informal text in general, and a better understanding of dialectal Arabic.

Author supplied keywords

Cite

CITATION STYLE

APA

Husain, F. (2023). A Novel Knowledge-augmented Model Customization Approach for Arabic Offensive Language Detection. ACM Transactions on Asian and Low-Resource Language Information Processing, 22(12). https://doi.org/10.1145/3634702

A Novel Knowledge-augmented Model Customization Approach for Arabic Offensive Language Detection

Abstract

Author supplied keywords

Cite

Register to see more suggestions