AraMUS: Pushing the Limits of Data and Model Scale for Arabic Natural Language Processing

0Citations
Citations of this article
18Readers
Mendeley users who have this article in their library.

Abstract

Developing monolingual large Pre-trained Language Models (PLMs) is shown to be very successful in handling different tasks in Natural Language Processing (NLP). In this work, we present AraMUS, the largest Arabic PLM with 11B parameters trained on 529GB of high-quality Arabic textual data. AraMUS achieves state-of-the-art performances on a diverse set of Arabic classification and generative tasks. Moreover, AraMUS shows impressive few-shot learning abilities compared with the best existing Arabic PLMs.

Cite

CITATION STYLE

APA

Alghamdi, A., Duan, X., Jiang, W., Wang, Z., Wu, Y., Xia, Q., … Ghaddar, A. (2023). AraMUS: Pushing the Limits of Data and Model Scale for Arabic Natural Language Processing. In Proceedings of the Annual Meeting of the Association for Computational Linguistics (pp. 2883–2894). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2023.findings-acl.181

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free