Detecting Code-Switching between Turkish-English Language Pair

19Citations
Citations of this article
91Readers
Mendeley users who have this article in their library.

Abstract

Code-switching (usage of different languages within a single conversation context in an alternative manner) is a highly increasing phenomenon in social media and colloquial usage which poses different challenges for natural language processing. This paper introduces the first study for the detection of Turkish-English code-switching and also a small test data collected from social media in order to smooth the way for further studies. The proposed system using character level n-grams and conditional random fields (CRFs) obtains 95.6% micro-averaged F1-score on the introduced test data set.

Cite

CITATION STYLE

APA

Yirmibeşoğlu, Z., & Eryiğit, G. (2018). Detecting Code-Switching between Turkish-English Language Pair. In 4th Workshop on Noisy User-Generated Text, W-NUT 2018 - Proceedings of the Workshop (pp. 110–115). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/w18-6115

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free