Language Identification in Code-Mixed Data using Multichannel Neural Networks and Context Capture

13Citations
Citations of this article
97Readers
Mendeley users who have this article in their library.

Abstract

An accurate language identification tool is an absolute necessity for building complex NLP systems to be used on code-mixed data. Lot of work has been recently done on the same, but there’s still room for improvement. Inspired from the recent advancements in neural network architectures for computer vision tasks, we have implemented multichannel neural networks combining CNN and LSTM for word level language identification of code-mixed data. Combining this with a Bi-LSTM-CRF context capture module, accuracies of 93.28% and 93.32% is achieved on our two testing sets.

Cite

CITATION STYLE

APA

Mandal, S., & Singh, A. K. (2018). Language Identification in Code-Mixed Data using Multichannel Neural Networks and Context Capture. In 4th Workshop on Noisy User-Generated Text, W-NUT 2018 - Proceedings of the Workshop (pp. 116–120). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/w18-6116

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free