Harvey Mudd College at SemEval-2019 task 4: The D.X. Beaumont hyperpartisan news detector

2Citations
Citations of this article
59Readers
Mendeley users who have this article in their library.

Abstract

We use the 600 hand-labelled articles from SemEval Task 4 (Kiesel et al., 2019) to hand-tune a classifier with 3000 features for the Hyperpartisan News Detection task. Our final system uses features based on bag-of-words (BoW), analysis of the article title, language complexity, and simple sentiment analysis in a naive Bayes classifier. We trained our final system on the 600,000 articles labelled by publisher. Our final system has an accuracy of 0.653 on the hand-labeled test set. The most effective features are the Automated Readability Index and the presence of certain words in the title. This suggests that hyperpartisan writing uses a distinct writing style, especially in the title.

Cite

CITATION STYLE

APA

Amason, E., Palanker, J., Shen, M. C., & Medero, J. (2019). Harvey Mudd College at SemEval-2019 task 4: The D.X. Beaumont hyperpartisan news detector. In NAACL HLT 2019 - International Workshop on Semantic Evaluation, SemEval 2019, Proceedings of the 13th Workshop (pp. 967–970). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/s19-2166

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free