FastText Word Embedding and Random Forest Classifier for User Feedback Sentiment Classification in Bahasa Indonesia

  • Gunawan Y
  • Young J
  • Rusli A
N/ACitations
Citations of this article
24Readers
Mendeley users who have this article in their library.

Abstract

User feedback nowadays become a platform for software developer to identify and understand user requirements, preferences, and user’s complaints. It is important for the developer to identify the problem that exist in user feedback. According to software growth, user amount also growth. Read and classify one by one manually are wasting time and energy. As the solution for the problem, sentiment analysis system using Random Forest Classifier which use word embedding as the feature extraction is made to help to classify which feedback is positive, neutral, or negative. Random Forest Algorithm is chosen because it gives the best performance, even its need the larger resources. Furthermore, with word embedding, the words which has semantic or syntactic similarities will be detected. Word embedding does not need stemming and stop word removal, so the context of the sentences keep remains. This research is made to implement word embedding to classify sentiment of user feedbacks using Random Forest Classifier. 70.27% accuracy, 80% precision, 54 recall and 54% F1 score is reached when BYU dataset (200 dimension) as embedding dataset with the train and test ratio 80:20.

Cite

CITATION STYLE

APA

Gunawan, Y., Young, J. C., & Rusli, A. (2022). FastText Word Embedding and Random Forest Classifier for User Feedback Sentiment Classification in Bahasa Indonesia. Ultimatics : Jurnal Teknik Informatika, 13(2), 101–107. https://doi.org/10.31937/ti.v13i2.2124

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free