Semantic analysis of urdu english tweets empowered by machine learning

Nadia Tabassum; Tahir Alyas; Muhammad Hamid; Muhammad Saleem; Saadia Malik; Zain Ali; Umer Farooq

Journal ArticleOPEN ACCESS

Semantic analysis of urdu english tweets empowered by machine learning

Intelligent Automation and Soft Computing (2021) 30(1) 175-186

DOI: 10.32604/iasc.2021.018998

15Citations

28Readers

Abstract

Development in the field of opinion mining and sentiment analysis has been rapid and aims to explore views or texts on various social media sites through machine-learning techniques with the sentiment, subjectivity analysis and calculations of polarity. Sentiment analysis is a natural language processing strategy used to decide if the information is positive, negative, or neutral and it is frequently performed on literature information to help organizations screen brand, item sentiment in client input, and comprehend client needs. In this paper, two strategies for sentiment analysis is proposed for word embedding and a bag of words on Urdu and English tweets. Word embedding is a notable arrangement of procedures that can remember words linguistics dependent on the spread theory which expresses that word is utilized and happens within the same settings tend to indicate comparable implications. Bag of words is an approach used in natural language processing to retrieve information and features from written documents. For the bag of words, machine learning techniques like naive bayes, decision tree, k-nearest neighbor, and support vector machine is used to enhance the accuracy. For word embedding the neural network technique is proposed by the combination of recurrent neural network (RNN) with long-short term memory (LSTM) for sentimental analysis of tweets. Datasets of Urdu and English tweets are used for negative and positive classification tweets with machine learning techniques. The contribution of this paper involves the implementation of a hybrid approach that focused on a sentiment analyzer to overcome social network challenges and also provided the comparative analysis of different machine learning algorithms. The results indicate improvement while using the combination of RNN with the help of LSTM showed accuracy 87% on the Urdu dataset and 92% on the English dataset.

Author supplied keywords

Cite

CITATION STYLE

APA

Tabassum, N., Alyas, T., Hamid, M., Saleem, M., Malik, S., Ali, Z., & Farooq, U. (2021). Semantic analysis of urdu english tweets empowered by machine learning. Intelligent Automation and Soft Computing, 30(1), 175–186. https://doi.org/10.32604/iasc.2021.018998

Semantic analysis of urdu english tweets empowered by machine learning

Abstract

Author supplied keywords

Cite

Register to see more suggestions