DCU-UVT: Word-Level Language Classification with Code-Mixed Data

28Citations
Citations of this article
88Readers
Mendeley users who have this article in their library.

Abstract

This paper describes the DCU-UVT team's participation in the Language Identification in Code-Switched Data shared task in the Workshop on Computational Approaches to Code Switching. Word-level classification experiments were carried out using a simple dictionary-based method, linear kernel support vector machines (SVMs) with and without contextual clues, and a k-nearest neighbour approach. Based on these experiments, we select our SVM-based system with contextual clues as our final system and present results for the Nepali-English and Spanish-English datasets.

Cite

CITATION STYLE

APA

Barman, U., Wagner, J., Chrupała, G., & Foster, J. (2014). DCU-UVT: Word-Level Language Classification with Code-Mixed Data. In 1st Workshop on Computational Approaches to Code Switching, Switching 2014 at the 2014 Conference on Empirical Methods in Natural Language Processing, EMNLP 2014 - Proceedings (pp. 127–132). Association for Computational Linguistics (ACL). https://doi.org/10.3115/v1/w14-3915

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free