Influence of weak labels for emotion recognition of tweets

Olivier Janssens; Steven Verstockt; Erik Mannens; Sofie Van Hoecke; Rik Van De Walle

Conference Proceedings

Influence of weak labels for emotion recognition of tweets

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2014) 8891 108-118

DOI: 10.1007/978-3-319-13817-6_12

5Citations

11Readers

Get full text

Abstract

Research on emotion recognition of tweets focuses on feature engineering or algorithm design, while dataset labels are barely questioned. Datasets of tweets are often labelled manually or via crowdsourcing, which results in strong labels. These methods are time intensive and can be expensive. Alternatively, tweet hashtags can be used as free, inexpensive weak labels. This paper investigates the impact of using weak labels compared to strong labels. The study uses two label sets for a corpus of tweets. The weakly annotated label set is created employing the hashtags of the tweets, while the strong label set is created by the use of crowdsourcing. Both label sets are used separately as input for five classification algorithms to determine the classification performance of the weak labels. The results indicate only a 9.25% decrease in f1-score when using weak labels. This performance decrease does not outweigh the benefits of having free labels.

Author supplied keywords

Cite

CITATION STYLE

APA

Janssens, O., Verstockt, S., Mannens, E., Van Hoecke, S., & Van De Walle, R. (2014). Influence of weak labels for emotion recognition of tweets. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 8891, pp. 108–118). Springer Verlag. https://doi.org/10.1007/978-3-319-13817-6_12

Influence of weak labels for emotion recognition of tweets

Abstract

Author supplied keywords

Cite

Register to see more suggestions