Pre-processing framework for twitter sentiment classification

Elias Dritsas; Gerasimos Vonitsanos; Ioannis E. Livieris; Andreas Kanavos; Aristidis Ilias; Christos Makris; Athanasios Tsakalidis

Conference ProceedingsOPEN ACCESS

Pre-processing framework for twitter sentiment classification

IFIP Advances in Information and Communication Technology (2019) 560 138-149

DOI: 10.1007/978-3-030-19909-8_12

11Citations

14Readers

Abstract

Twitter Sentiment Classification is undergoing great appeal from the research community; also, user posts and opinions are producing very interesting conclusions and information. In the context of this paper, a pre-processing tool was developed in Python language. This tool processes text and natural language data intending to remove wrong values and noise. The main reason for developing such a tool is to achieve sentiment analysis in an optimum and efficient way. The most remarkable characteristic is considered the use of emojis and emoticons in the sentiment analysis field. Moreover, supervised machine learning techniques were utilized for the analysis of users' posts. Through our experiments, the performance of the involved classifiers, namely Naive Bayes and SVM, under specific parameters such as the size of the training data, the employed methods for feature selection (unigrams, bigrams and trigrams) are evaluated. Finally, the performance was assessed based on independent datasets through the application of k-fold cross validation.

Author supplied keywords

Cite

CITATION STYLE

APA

Dritsas, E., Vonitsanos, G., Livieris, I. E., Kanavos, A., Ilias, A., Makris, C., & Tsakalidis, A. (2019). Pre-processing framework for twitter sentiment classification. In IFIP Advances in Information and Communication Technology (Vol. 560, pp. 138–149). Springer New York LLC. https://doi.org/10.1007/978-3-030-19909-8_12

Pre-processing framework for twitter sentiment classification

Abstract

Author supplied keywords

Cite

Register to see more suggestions