In social media, there is a vast amount of information pertaining to people's emotions and the corresponding causes. The emotion cause extraction (ECE) from social media data is an important research area that has not been thoroughly explored due to the lack of fine-grained annotations. Early studies referred to either unsupervised rule-based methods or supervised machine learning methods using a number of manually annotated data in specific domains. However, the former suffers from limitations in extraction performance, while the latter is constrained by the availability of fine-grained annotations and struggles to generalize to diverse domains. To address these issues, this paper proposes a new ECE framework on Chinese social media that achieves high extraction performance and generalizability without relying on human annotation. Specifically, we design a more dedicated rule-based system based on constituency parsing tree to discover causal patterns in social media. This system enables us to acquire large amounts of fine-grained annotated data. Next, we train a neural model on the rule-annotated dataset with a specific training strategy to further improve the model's generalizability. Extensive experiments demonstrate the superiority of our approach over other methods in unsupervised and weakly-supervised settings.
CITATION STYLE
Xiao, D., Xia, R., & Yu, J. (2023). Emotion Cause Extraction on Social Media without Human Annotation. In Proceedings of the Annual Meeting of the Association for Computational Linguistics (pp. 1455–1468). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2023.findings-acl.94
Mendeley helps you to discover research relevant for your work.