We use the 600 hand-labelled articles from SemEval Task 4 (Kiesel et al., 2019) to hand-tune a classifier with 3000 features for the Hyperpartisan News Detection task. Our final system uses features based on bag-of-words (BoW), analysis of the article title, language complexity, and simple sentiment analysis in a naive Bayes classifier. We trained our final system on the 600,000 articles labelled by publisher. Our final system has an accuracy of 0.653 on the hand-labeled test set. The most effective features are the Automated Readability Index and the presence of certain words in the title. This suggests that hyperpartisan writing uses a distinct writing style, especially in the title.
CITATION STYLE
Amason, E., Palanker, J., Shen, M. C., & Medero, J. (2019). Harvey Mudd College at SemEval-2019 task 4: The D.X. Beaumont hyperpartisan news detector. In NAACL HLT 2019 - International Workshop on Semantic Evaluation, SemEval 2019, Proceedings of the 13th Workshop (pp. 967–970). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/s19-2166
Mendeley helps you to discover research relevant for your work.