Creation of Corpus and analysis in Code-Mixed Kannada-English Twitter data for Emotion Prediction

12Citations
Citations of this article
67Readers
Mendeley users who have this article in their library.

Abstract

Emotion prediction is a critical task in the field of Natural Language Processing (NLP). There has been a significant amount of work done in emotion prediction for resource-rich languages. There has been work done on code-mixed social media corpus but not on emotion prediction of Kannada-English code-mixed Twitter data. In this paper, we analyze the problem of emotion prediction on corpus obtained from code-mixed Kannada-English extracted from Twitter annotated with their respective ‘Emotion’ for each tweet. We experimented with machine learning prediction models using features like Character N-Grams, Word N-Grams, Repetitive characters, and others on SVM and LSTM on our corpus, which resulted in an accuracy of 30% and 32%, respectively.

Cite

CITATION STYLE

APA

Reddy, A. A., Srirangam, V. K., Darsi, S., & Shrivastava, M. (2020). Creation of Corpus and analysis in Code-Mixed Kannada-English Twitter data for Emotion Prediction. In COLING 2020 - 28th International Conference on Computational Linguistics, Proceedings of the Conference (pp. 6703–6709). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2020.coling-main.587

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free