Evaluating Active Learning Sampling Strategies for Opinion Mining in Brazilian Politics Corpora

Douglas Vitório; Ellen Souza; Adriano L.I. Oliveira

Conference Proceedings

Evaluating Active Learning Sampling Strategies for Opinion Mining in Brazilian Politics Corpora

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2019) 11805 LNAI 695-707

DOI: 10.1007/978-3-030-30244-3_57

4Citations

10Readers

Get full text

Abstract

Politics is a commonly used domain in Opinion Mining applications, in which opinions may change over time. Nevertheless, the usual approaches for Opinion Mining are not able to deal with the characteristics and the challenges brought by continuous data streams; so, an alternative is the use of techniques such as Active Learning, which labels selected data rather than the entire data set. The Active Learning approach requires the choice of a sampling strategy to select the most valuable instances. However, no study has performed an analysis in order to identify the best strategies for Opinion Mining. In this sense, we evaluated eight Active Learning sampling strategies, from which Entropy achieved the best results. In addition, due to the lack of publicly available stream data sets written in Portuguese, we created and evaluated corpora from Twitter and Facebook about the 2018 Brazilian presidential elections.

Author supplied keywords

Cite

CITATION STYLE

APA

Vitório, D., Souza, E., & Oliveira, A. L. I. (2019). Evaluating Active Learning Sampling Strategies for Opinion Mining in Brazilian Politics Corpora. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 11805 LNAI, pp. 695–707). Springer Verlag. https://doi.org/10.1007/978-3-030-30244-3_57

Evaluating Active Learning Sampling Strategies for Opinion Mining in Brazilian Politics Corpora

Abstract

Author supplied keywords

Cite

Register to see more suggestions