Cardiff University at SemEval-2019 task 4: Linguistic features for hyperpartisan news detection

Carla Pérez-Almendros; Luis Espinosa-Anke; Steven Schockaert

Conference Proceedings

Cardiff University at SemEval-2019 task 4: Linguistic features for hyperpartisan news detection

NAACL HLT 2019 - International Workshop on Semantic Evaluation, SemEval 2019, Proceedings of the 13th Workshop (2019) 929-933

DOI: 10.18653/v1/s19-2158

3Citations

69Readers

Get full text

Abstract

This paper summarizes our contribution to the Hyperpartisan News Detection task in SemEval 2019. We experiment with two different approaches: 1) an SVM classifier based on word vector averages and hand-crafted linguistic features, and 2) a BiLSTM-based neural text classifier trained on a filtered training set. Surprisingly, despite their different nature, both approaches achieve an accuracy of 0.74. The main focus of this paper is to further analyze the remarkable fact that a simple feature-based approach can perform on par with modern neural classifiers. We also highlight the effectiveness of our filtering strategy for training the neural network on a large but noisy training set.

Cite

CITATION STYLE

APA

Pérez-Almendros, C., Espinosa-Anke, L., & Schockaert, S. (2019). Cardiff University at SemEval-2019 task 4: Linguistic features for hyperpartisan news detection. In NAACL HLT 2019 - International Workshop on Semantic Evaluation, SemEval 2019, Proceedings of the 13th Workshop (pp. 929–933). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/s19-2158

Cardiff University at SemEval-2019 task 4: Linguistic features for hyperpartisan news detection

Abstract

Cite

Register to see more suggestions