DenS: A dataset for multi-class emotion analysis

Chen Liu; Muhammad Osama; Anderson de Andrade

Conference Proceedings

DenS: A dataset for multi-class emotion analysis

EMNLP-IJCNLP 2019 - 2019 Conference on Empirical Methods in Natural Language Processing and 9th International Joint Conference on Natural Language Processing, Proceedings of the Conference (2019) 6293-6298

DOI: 10.18653/v1/D19-1656

20Citations

115Readers

Get full text

Abstract

We introduce a new dataset for multi-class emotion analysis from long-form narratives in English. The Dataset for Emotions of Narrative Sequences (DENS) was collected from both classic literature available on Project Gutenberg and modern online narratives available on Wattpad, annotated using Amazon Mechanical Turk. A number of statistics and baseline benchmarks are provided for the dataset. Of the tested techniques, we find that the fine-tuning of a pre-trained BERT model achieves the best results, with an average micro-F1 score of 60.4%. Our results show that the dataset provides a novel opportunity in emotion analysis that requires moving beyond existing sentence-level techniques.

Cite

CITATION STYLE

APA

Liu, C., Osama, M., & de Andrade, A. (2019). DenS: A dataset for multi-class emotion analysis. In EMNLP-IJCNLP 2019 - 2019 Conference on Empirical Methods in Natural Language Processing and 9th International Joint Conference on Natural Language Processing, Proceedings of the Conference (pp. 6293–6298). Association for Computational Linguistics. https://doi.org/10.18653/v1/D19-1656

DenS: A dataset for multi-class emotion analysis

Abstract

Cite

Register to see more suggestions