Detecting Code-Switching between Turkish-English Language Pair

Zeynep Yirmibeşoğlu; Gülşen Eryiğit

Conference ProceedingsOPEN ACCESS

Detecting Code-Switching between Turkish-English Language Pair

4th Workshop on Noisy User-Generated Text, W-NUT 2018 - Proceedings of the Workshop (2018) 110-115

DOI: 10.18653/v1/w18-6115

19Citations

91Readers

Abstract

Code-switching (usage of different languages within a single conversation context in an alternative manner) is a highly increasing phenomenon in social media and colloquial usage which poses different challenges for natural language processing. This paper introduces the first study for the detection of Turkish-English code-switching and also a small test data collected from social media in order to smooth the way for further studies. The proposed system using character level n-grams and conditional random fields (CRFs) obtains 95.6% micro-averaged F1-score on the introduced test data set.

Cite

CITATION STYLE

APA

Yirmibeşoğlu, Z., & Eryiğit, G. (2018). Detecting Code-Switching between Turkish-English Language Pair. In 4th Workshop on Noisy User-Generated Text, W-NUT 2018 - Proceedings of the Workshop (pp. 110–115). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/w18-6115

Detecting Code-Switching between Turkish-English Language Pair

Abstract

Cite

Register to see more suggestions