Tag me a label with multi-arm: Active learning for Telugu sentiment analysis

Sandeep Sricharan Mukku; Subba Reddy Oota; Radhika Mamidi

Conference Proceedings

Tag me a label with multi-arm: Active learning for Telugu sentiment analysis

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2017) 10440 LNCS 355-367

DOI: 10.1007/978-3-319-64283-3_26

6Citations

8Readers

Get full text

Abstract

Sentiment Analysis is one of the most active research areas in natural language processing and an extensively studied problem in data mining, web mining and text mining for English language. With the proliferation of social media these days, data is widely increasing in regional languages along with English. Telugu is one such regional language with abundant data available in social media, but it’s hard to find a labeled training set as human annotation is time-consuming and cost-ineffective. To address this issue, in this paper the practicality of active learning for Telugu sentiment analysis is investigated. We built a hybrid approach by combining different query selection strategy frameworks to increase more accurate training data instances with limited labeled data. Using a set of classifiers like SVM, XGBoost, and Gradient Boosted Trees (GBT), we achieved promising results with minimal error rate.

Author supplied keywords

Cite

CITATION STYLE

APA

Mukku, S. S., Oota, S. R., & Mamidi, R. (2017). Tag me a label with multi-arm: Active learning for Telugu sentiment analysis. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 10440 LNCS, pp. 355–367). Springer Verlag. https://doi.org/10.1007/978-3-319-64283-3_26

Tag me a label with multi-arm: Active learning for Telugu sentiment analysis

Abstract

Author supplied keywords

Cite

Register to see more suggestions