In this paper, we compare lexicon-based and machine learning-based approaches to define the subjectivity of tweets in Portuguese. We tested SentiLex and WordAffectBR lexicons, and Sequential Machine Optimization and Naive Bayes algorithms for this task. In our study, we used the Computer-BR corpus that contains messages about the technology area. We obtained better results using the Comprehensive Measurement Feature Selection method and the Sequential Machine Optimization algorithm as the classifier. We achieved considerable accuracy when we included the polarities of words in the vector space model of tweets.
CITATION STYLE
Moraes, S. M. W., Santos, A. L. L., Redecker, M., Machado, R. M., & Meneguzzi, F. R. (2016). Comparing approaches to subjectivity classification: A study on Portuguese tweets. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 9727, pp. 86–94). Springer Verlag. https://doi.org/10.1007/978-3-319-41552-9_8
Mendeley helps you to discover research relevant for your work.