Abstract
This paper summarizes our contribution to the Hyperpartisan News Detection task in SemEval 2019. We experiment with two different approaches: 1) an SVM classifier based on word vector averages and hand-crafted linguistic features, and 2) a BiLSTM-based neural text classifier trained on a filtered training set. Surprisingly, despite their different nature, both approaches achieve an accuracy of 0.74. The main focus of this paper is to further analyze the remarkable fact that a simple feature-based approach can perform on par with modern neural classifiers. We also highlight the effectiveness of our filtering strategy for training the neural network on a large but noisy training set.
Cite
CITATION STYLE
Pérez-Almendros, C., Espinosa-Anke, L., & Schockaert, S. (2019). Cardiff University at SemEval-2019 task 4: Linguistic features for hyperpartisan news detection. In NAACL HLT 2019 - International Workshop on Semantic Evaluation, SemEval 2019, Proceedings of the 13th Workshop (pp. 929–933). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/s19-2158
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.