Long short-term memory for hate speech and abusive language detection on Indonesian YouTube comment section

3Citations
Citations of this article
22Readers
Mendeley users who have this article in their library.

Abstract

Hate speech is one of the most challenging problem internet is facing today. With increasing numbers of users online, hate speech also rise and takes time to be classified manually particularly in languages other than English. This research examines hate speech detection problem in form of Bahasa Indonesia. Millions of comments and text posts are added to various social media and discussion platforms. Manual classification in all of the internet as hate speech and offensive language is a near impossible and time-consuming task. This research uses Long Short-Term Memory (LSTM) and Bidirectional Long Short Term Memory (Bi-LSTM) for the method of classifying hate speech and abusive language. The final accuracy is 88,44% by using 200 neurons with Bi-LSTM method. Most common challenges are different languages, out of vocabulary words, long range dependencies, and sarcasm.

Cite

CITATION STYLE

APA

Salim, C. E. R., & Suhartono, D. (2021). Long short-term memory for hate speech and abusive language detection on Indonesian YouTube comment section. In 2021 11th International Workshop on Computer Science and Engineering, WCSE 2021 (pp. 193–200). International Workshop on Computer Science and Engineering (WCSE). https://doi.org/10.18178/wcse.2021.06.029

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free