Using stylometric features for sentiment classification

Rafael T. Anchiêta; Francisco Assis Ricarte Neto; Rogério Figueiredo De Sousa; Raimundo Santos Moura

Conference Proceedings

Using stylometric features for sentiment classification

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2015) 9042 189-200

DOI: 10.1007/978-3-319-18117-2_15

6Citations

15Readers

Get full text

Abstract

This paper is a comparative study about text feature extractionmethods in statistical learning of sentiment classification. Featureextraction is one of the most important steps in classification systems.We use stylometry to compare with TF-IDF and Delta TF-IDF baselinemethods in sentiment classification. Stylometry is a research areaof Linguistics that uses statistical techniques to analyze literary style.In order to assess the viability of the stylometry, we create a corpus ofproduct reviews from the most traditional online service in Portuguese,namely, Buscapé. We gathered 2000 review about Smartphones. We usethree classifiers, Support Vector Machine (SVM), Naive Bayes, and J48to evaluate whether the stylometry has higher accuracy than the TFIDFand Delta TF-IDF methods in sentiment classification. We foundthe better result with the SVM classifier (82,75%) of accuracy with stylometryand (72,62%) with Delta TF-IDF and (56,25%) with TF-IDF.The results show that stylometry is quite feasible method for sentimentclassification, outperforming the accuracy of the baseline methods.We may emphasize that approach used has promising results.

Cite

CITATION STYLE

APA

Anchiêta, R. T., Neto, F. A. R., De Sousa, R. F., & Moura, R. S. (2015). Using stylometric features for sentiment classification. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 9042, pp. 189–200). Springer Verlag. https://doi.org/10.1007/978-3-319-18117-2_15

Using stylometric features for sentiment classification

Abstract

Cite

Register to see more suggestions