HyVADRF: Hybrid VADER-Random Forest and GWO for Bitcoin Tweet Sentiment Analysis

33Citations
Citations of this article
78Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

In recent years, Bitcoin and other cryptocurrencies have been increasingly considered investment options for emerging markets. However, Bitcoin's erratic behavior has discouraged some potential investors. To get insights into its behavior and price fluctuation, past studies have discovered the correlation between Twitter sentiments and Bitcoin behavior. Most of them have exclusively focused on their relationships, instead of the Twitter sentiment analysis itself. Finding the most suitable classification algorithms for sentiment analysis for this kind of data is challenging. For the enormous data in Twitter, the supervised sentiment analysis approach of unlabeled data can be time-consuming and expensive, which has been studied to be superior to unsupervised ones. As such, we propose the HyVADRF (hybrid valence aware dictionary and sentiment reasoner (VADER)-random forest) and gray wolf optimizer (GWO) model. A semantic and rule-based VADER was used to calculate polarity scores and classify sentiments, which overcame the weakness of manual labeling, while a random forest was utilized as its supervised classifier. Furthermore, considering Twitter's massive size, we collected over 3.6 million tweets and analyzed various dataset sizes as these are related to the model's learning process. Lastly, GWO parameter tuning was conducted to optimize the classifier's performance. The results show that 1) the HyVADRF model had an accuracy of 75.29%, precision of 70.22%, recall of 87.70%, and F1-score of 78%. 2) The most ideal dataset size percentage is 90% of the total collected tweets ( n =1 ,249,060). 3) The standard deviations are 0.0008 for accuracy and F1-score and 0.0011 for precision and recall. Hence, the HyVADRF model consistently delivers stable results.

Cite

CITATION STYLE

APA

Mardjo, A., & Choksuchat, C. (2022). HyVADRF: Hybrid VADER-Random Forest and GWO for Bitcoin Tweet Sentiment Analysis. IEEE Access, 10, 101889–101897. https://doi.org/10.1109/ACCESS.2022.3209662

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free