Evaluating the Accuracy and Efficiency of Sentiment Analysis Pipelines with UIMA

Nabeela Altrabsheh; Georgios Kontonatsios; Yannis Korkontzelos

Conference Proceedings

Evaluating the Accuracy and Efficiency of Sentiment Analysis Pipelines with UIMA

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2019) 11608 LNCS 286-294

DOI: 10.1007/978-3-030-23281-8_23

1Citations

11Readers

Get full text

Abstract

Sentiment analysis methods co-ordinate text mining components, such as sentence splitters, tokenisers and classifiers, into pipelined applications to automatically analyse the emotions or sentiment expressed in textual content. However, the performance of sentiment analysis pipelines is known to be substantially affected by the constituent components. In this paper, we leverage the Unstructured Information Management Architecture (UIMA) to seamlessly co-ordinate components into sentiment analysis pipelines. We then evaluate a wide range of different combinations of text mining components to identify optimal settings. More specifically, we evaluate different pre-processing components, e.g.Â tokenisers and stemmers, feature weighting schemes, e.g.Â TF and TFIDF, feature types, e.g.Â bigrams, trigrams and bigrams+trigrams, and classification algorithms, e.g.Â Support Vector Machines, Random Forest and Naive Bayes, against 6 publicly available datasets. The results demonstrate that optimal configurations are consistent across the 6 datasets while our UIMA-based pipeline yields a robust performance when compared to baseline methods.

Author supplied keywords

Cite

CITATION STYLE

APA

Altrabsheh, N., Kontonatsios, G., & Korkontzelos, Y. (2019). Evaluating the Accuracy and Efficiency of Sentiment Analysis Pipelines with UIMA. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 11608 LNCS, pp. 286–294). Springer Verlag. https://doi.org/10.1007/978-3-030-23281-8_23

Evaluating the Accuracy and Efficiency of Sentiment Analysis Pipelines with UIMA

Abstract

Author supplied keywords

Cite

Register to see more suggestions