BCSAT : A benchmark corpus for sentiment analysis in telugu using word-level annotations

Sreekavitha Parupalli; Vijjini Anvesh Rao; Radhika Mamidi

Conference ProceedingsOPEN ACCESS

BCSAT : A benchmark corpus for sentiment analysis in telugu using word-level annotations

ACL 2018 - 56th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Student Research Workshop (2018) 99-104

DOI: 10.18653/v1/p18-3014

4Citations

92Readers

Abstract

The presented work aims at generating a systematically annotated corpus that can support the enhancement of senti- ment analysis tasks in Telugu using word- level sentiment annotations. From On- toSenseNet, we extracted 11,000 adjec- tives, 253 adverbs, 8483 verbs and sen- timent annotation is being done by lan- guage experts. We discuss the methodol- ogy followed for the polarity annotations and validate the developed resource. This work aims at developing a benchmark cor- pus, as an extension to SentiWordNet, and baseline accuracy for a model where lex- eme annotations are applied for sentiment predictions. The fundamental aim of this paper is to validate and study the possi- bility of utilizing machine learning algo- rithms, word-level sentiment annotations in the task of automated sentiment identifi- cation. Furthermore, accuracy is improved by annotating the bi-grams extracted from the target corpus.

References Powered by Scopus

View more at Scopus

Cited by Powered by Scopus

View more at Scopus

Cite

CITATION STYLE

APA

Parupalli, S., Rao, V. A., & Mamidi, R. (2018). BCSAT : A benchmark corpus for sentiment analysis in telugu using word-level annotations. In ACL 2018 - 56th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Student Research Workshop (pp. 99–104). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/p18-3014

Readers over time

Readers' Seniority

PhD / Post grad / Masters / Doc 26

63%

Researcher 9

22%

Lecturer / Post doc 5

12%

Professor / Associate Prof. 1

Readers' Discipline

Computer Science 35

76%

Linguistics 7

15%

Social Sciences 2

Engineering 2

BCSAT : A benchmark corpus for sentiment analysis in telugu using word-level annotations

Abstract

References Powered by Scopus

Weighted kappa: Nominal scale agreement provision for scaled disagreement or partial credit

Recognizing contextual polarity in phrase-level sentiment analysis

Measuring praise and criticism: Inference of semantic orientation from association

Cited by Powered by Scopus

Am I a Resource-Poor Language? Data Sets, Embeddings, Models and Analysis for four different NLP Tasks in Telugu Language

A Benchmark of Modeling for Sentiment Analysis of the Indonesian Presidential Election in 2019

Corpus Creation in Telugu: Sentiment Classification Using Ensemble Approaches

Register to see more suggestions

Cite

Readers over time

Readers' Seniority

Readers' Discipline