Corpora Based Classification to Perform Sentiment Analysis in Kannada Language

  • R* S
  • et al.
N/ACitations
Citations of this article
4Readers
Mendeley users who have this article in their library.
Get full text

Abstract

In this modern era, the users’ opinions play an uncanny role in understanding how well a product has satisfied the customer requirements, so that the producer can change the product to suit the customers’ demands and these reviews also help the new consumers to decide on whether to purchase the product or not. Analysis of a particular entity's feelings in terms of positive, negative or neutral polarization is known as ‘Sentiment Analysis’. SentimentAnalysis is a sub-domain of opinion mining.Here the analysis is focused on the mining of emotions and opinions of the people towards a specific topic. The emotions and opinions are collected in the form of organized, semi-organized or amorphous data. As the world is slowly progressing towards regional languages, this article talks about extracting the opinions of a product in Kannada and performing analysis about these reviews and classifying them accordingly. The dataset or the corpus is scarce as it is not English. The limited corpus is being collected via website – https://gadgetloka.com through an API. However, extracting inclusive opinion manually from huge amorphous data would be a tedious task. An automated system called 'Sentiment Analysis or Opinion Mining' can solve this problem, which can analyze and extract the observation of the user throughout the reviews. In this classifier of review analysis, the process classifies the review via corpus, which is a huge collection of pre-defined data. The API that has been used is Python-Beautiful Soup via utf-8 text recognition method to parse Kannada characters. The reviews are converted to text sentence and each word of the sentence are broken down. Data mining task is done to find the sentiment of each word by comparing it with two stored files named as good.txt and bad.txt. Further, the analyzed result is given through text output as Positive, Negative or Neutral sentiments based on their weights.

Cite

CITATION STYLE

APA

R*, S., & Swamy, S. (2020). Corpora Based Classification to Perform Sentiment Analysis in Kannada Language. International Journal of Recent Technology and Engineering (IJRTE), 8(5), 3186–3191. https://doi.org/10.35940/ijrte.e6872.018520

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free