DCU-UVT: Word-Level Language Classification with Code-Mixed Data

Utsab Barman; Joachim Wagner; Grzegorz Chrupała; Jennifer Foster

Conference ProceedingsOPEN ACCESS

DCU-UVT: Word-Level Language Classification with Code-Mixed Data

1st Workshop on Computational Approaches to Code Switching, Switching 2014 at the 2014 Conference on Empirical Methods in Natural Language Processing, EMNLP 2014 - Proceedings (2014) 127-132

DOI: 10.3115/v1/w14-3915

28Citations

88Readers

Abstract

This paper describes the DCU-UVT team's participation in the Language Identification in Code-Switched Data shared task in the Workshop on Computational Approaches to Code Switching. Word-level classification experiments were carried out using a simple dictionary-based method, linear kernel support vector machines (SVMs) with and without contextual clues, and a k-nearest neighbour approach. Based on these experiments, we select our SVM-based system with contextual clues as our final system and present results for the Nepali-English and Spanish-English datasets.

Cite

CITATION STYLE

APA

Barman, U., Wagner, J., Chrupała, G., & Foster, J. (2014). DCU-UVT: Word-Level Language Classification with Code-Mixed Data. In 1st Workshop on Computational Approaches to Code Switching, Switching 2014 at the 2014 Conference on Empirical Methods in Natural Language Processing, EMNLP 2014 - Proceedings (pp. 127–132). Association for Computational Linguistics (ACL). https://doi.org/10.3115/v1/w14-3915

DCU-UVT: Word-Level Language Classification with Code-Mixed Data

Abstract

Cite

Register to see more suggestions