Sentiment analysis of Malayalam tweets using bidirectional encoder representations from transformers: a study

11Citations
Citations of this article
46Readers
Mendeley users who have this article in their library.

Abstract

Sentiment analysis on views and opinions expressed in Indian regional languages has become the current focus of research. But, compared to a globally accepted language like English, research on sentiment analysis in Indian regional languages like Malayalam are very low. One of the major hindrances is the lack of publicly available Malayalam datasets. This work focuses on building a Malayalam dataset for facilitating sentiment analysis on Malayalam texts and studying the efficiency of a pre-trained deep learning model in analyzing the sentiments latent in Malayalam texts. In this work, a Malayalam dataset has been created by extracting 2,000 tweets from Twitter. The bidirectional encoder representations from transformers (BERT) is a pre-trained model that has been used for various natural language processing tasks. This work employs a transformer-based BERT model for Malayalam sentiment analysis. The efficacy of BERT in analyzing the sentiments latent in Malayalam texts has been studied by comparing the performance of BERT with various machine learning models as well as deep learning models. By analyzing the results, it is found that a substantial increase in accuracy of 5% for BERT when compared with that of Bi-GRU, which is the next best-performing model.

Cite

CITATION STYLE

APA

Elankath, S. M., & Ramamirtham, S. (2023). Sentiment analysis of Malayalam tweets using bidirectional encoder representations from transformers: a study. Indonesian Journal of Electrical Engineering and Computer Science, 29(3), 1817–1826. https://doi.org/10.11591/ijeecs.v29.i3.pp1817-1826

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free