Abstract
In this paper, we propose efficient and less resource-intensive strategies for parsing of code-mixed data. These strategies are not constrained by in-domain annotations, rather they leverage pre-existing monolingual annotated resources for training. We show that these methods can produce significantly better results as compared to an informed baseline. Besides, we also present a data set of 450 Hindi and English code-mixed tweets of Hindi multilingual speakers for evaluation. The data set is manually annotated with Universal Dependencies.
Cite
CITATION STYLE
Bhat, I. A., Bhat, R. A., Shrivastava, M., & Sharma, D. M. (2017). Joining Hands: Exploiting monolingual treebanks for parsing of code-mixing data. In 15th Conference of the European Chapter of the Association for Computational Linguistics, EACL 2017 - Proceedings of Conference (Vol. 2, pp. 324–330). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/e17-2052
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.