Annotation of a Corpus of Tweets for Sentiment Analysis

Allisfrank dos Santos; Jorge Daniel Barros Júnior; Heloisa de Arruda Camargo

Conference Proceedings

Annotation of a Corpus of Tweets for Sentiment Analysis

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2018) 11122 LNAI 294-302

DOI: 10.1007/978-3-319-99722-3_30

2Citations

4Readers

Get full text

Abstract

This article describes the process of creation and annotation of a tweets corpus for Sentiment Analysis at sentence level. The tweets were captured using the #masterchefbr hashtag, in a tool to acquire the public stream of tweets in real time and then annotated based on the six basic emotions (joy, surprise, fear, sadness, disgust, anger) commonly used in the literature. The neutral tag was adopted to annotate sentences where there was no expressed emotion. At the end of the process, the measure of disagreement between annotators reached a Kappa value of 0.42. Some experiments with the SVM algorithm (Support Vector Machine) have been performed with the objective of submitting the annotated corpus to a classification process, to better understand the Kappa value of the corpus. An accuracy of 52.9% has been obtained in the classification process when using both discordant and concordant text within the corpus.

Author supplied keywords

Cite

CITATION STYLE

APA

dos Santos, A., Júnior, J. D. B., & de Arruda Camargo, H. (2018). Annotation of a Corpus of Tweets for Sentiment Analysis. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 11122 LNAI, pp. 294–302). Springer Verlag. https://doi.org/10.1007/978-3-319-99722-3_30

Annotation of a Corpus of Tweets for Sentiment Analysis

Abstract

Author supplied keywords

Cite

Register to see more suggestions