Code-switching language modeling with bilingual word embeddings: A case study for egyptian arabic-english

Injy Hamed; Moritz Zhu; Mohamed Elmahdy; Slim Abdennadher; Ngoc Thang Vu

Conference Proceedings

Code-switching language modeling with bilingual word embeddings: A case study for egyptian arabic-english

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2019) 11658 LNAI 160-170

DOI: 10.1007/978-3-030-26061-3_17

3Citations

11Readers

Get full text

Abstract

Code-switching (CS) is a widespread phenomenon among bilingual and multilingual societies. The lack of CS resources hinders the performance of many NLP tasks. In this work, we explore the potential use of bilingual word embeddings for code-switching (CS) language modeling (LM) in the low resource Egyptian Arabic-English language. We evaluate different state-of-the-art bilingual word embeddings approaches that require cross-lingual resources at different levels and propose an innovative but simple approach that jointly learns bilingual word representations without the use of any parallel data, relying only on monolingual and a small amount of CS data. While all representations improve CS LM, ours performs the best and improves perplexity 33.5% relative over the baseline.

Cite

CITATION STYLE

APA

Hamed, I., Zhu, M., Elmahdy, M., Abdennadher, S., & Vu, N. T. (2019). Code-switching language modeling with bilingual word embeddings: A case study for egyptian arabic-english. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 11658 LNAI, pp. 160–170). Springer Verlag. https://doi.org/10.1007/978-3-030-26061-3_17

Code-switching language modeling with bilingual word embeddings: A case study for egyptian arabic-english

Abstract

Cite

Register to see more suggestions