Hybrid model: IndoBERT and long short-term memory for detecting Indonesian hoax news

0Citations
Citations of this article
31Readers
Mendeley users who have this article in their library.

Abstract

The world has entered an era that technology has developed far. Due to rapid technological development, information is easily spread. However, not all information spread through social media is factual information. Responding to this social phenomenon, we initiated to create a hoax detection system using the combined method of Indo bidirectional encoder representations from transformers (IndoBERT) and long short-term memory (LSTM). The dataset used in this study are obtained through the process scraping on the site turnbackhoax.id and cable news network (CNN) Indonesia. We decided to use the IndoBERT-LSTM method to detect hoaxes, using IndoBERT as the feature extractor and LSTM as the classification layer can be an effective method because of its advantages in managing and understanding Indonesian language. The results show that the IndoBERT-LSTM model achieved an accuracy of 93.2%, precision of 92%, recall of 89.7%, and F1-score of 90,8%. From a total of 5876 data composed of a total of 1998 factual news and 3878 hoax data. The hoax detection system using IndoBERT-LSTM is a promising approach for detecting hoaxes accurately and efficiently. This model has the potential to make a significant impact in the fight against the spread of Hoaxes.

Cite

CITATION STYLE

APA

Yefferson, D. Y., Lawijaya, V., & Girsang, A. S. (2024). Hybrid model: IndoBERT and long short-term memory for detecting Indonesian hoax news. IAES International Journal of Artificial Intelligence, 13(2), 1911–1922. https://doi.org/10.11591/ijai.v13.i2.pp1913-1924

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free