Sentiment Classification Using Multinomial Logistic Regression on Roman Urdu Text

Irfan Qutab; Khawar Iqbal Malik; Hira Arooj

Journal ArticleOPEN ACCESS

Sentiment Classification Using Multinomial Logistic Regression on Roman Urdu Text

Qutab I
Malik K
Arooj H

International Journal of Innovations in Science and Technology (2022) 4(2) 323-335

DOI: 10.33411/ijist/2022040204

N/ACitations

18Readers

Abstract

Sentiment analysis seeks to reveal textual knowledge of literary documents in which people communicate their thoughts and views on shared platforms, such as social blogs. On social blogs, users detail is available as short comments. A question of sentiment analysis has been raised by information across large dimensions published on these blogs. Although, some language libraries are established to address the problem of emotional analysis but limited work is available on Roman Urdu language because most of the comments or opinions available online are published in text-free style. The present study evaluates emotions in the comments of Roman Urdu by using a machine learning technique. This analysis was done in different stages of data collection, labeling, pre-processing, and feature extraction. In the final phase, we used the pipeline method along with Multinomial Logistic Regression for the classification of the dataset into four categories (Politics, Sports, Education and Religion). The whole dataset was divided into training and test sets. We evaluated our test set and achieved results by using Precision, Recall, Accuracy, F1 Score and Confusion Matrix and found the accuracy ranging to 94%.

Cite

CITATION STYLE

APA

Qutab, I., Malik, K. I., & Arooj, H. (2022). Sentiment Classification Using Multinomial Logistic Regression on Roman Urdu Text. International Journal of Innovations in Science and Technology, 4(2), 323–335. https://doi.org/10.33411/ijist/2022040204

Sentiment Classification Using Multinomial Logistic Regression on Roman Urdu Text

Abstract

Cite

Register to see more suggestions