Aim/Purpose The aim of this paper is to propose an ensemble learners based classification model for classification clickbaits from genuine article headlines. Background Clickbaits are online articles with deliberately designed misleading titles for luring more and more readers to open the intended web page. Clickbaits are used to tempted visitors to click on a particular link either to monetize the landing page or to spread the false news for sensationalization. The presence of clickbaits on any news aggregator portal may lead to an unpleasant experience for readers. Therefore, it is essential to distinguish clickbaits from authentic headlines to mit-igate their impact on readers' perception. Methodology A total of one hundred thousand article headlines are collected from news ag-gregator sites consists of clickbaits and authentic news headlines. The collected data samples are divided into five training sets of balanced and unbalanced data. The natural language processing techniques are used to extract 19 manually se-lected features from article headlines. Contribution Three ensemble learning techniques including bagging, boosting, and random forests are used to design a classifier model for classifying a given headline into the clickbait or non-clickbait. The performances of learners are evaluated using accuracy, precision, recall, and F-measures. Findings It is observed that the random forest classifier detects clickbaits better than the other classifiers with an accuracy of 91.16 %, a total precision, recall, and f-measure of 91 %.
CITATION STYLE
Sisodia, D. S. (2019). Ensemble learning approach for clickbait detection using article headline features. Informing Science, 22(2019), 31–44. https://doi.org/10.28945/4279
Mendeley helps you to discover research relevant for your work.