Enhanced bootstrapping algorithm for automatic annotation of tweets

5Citations
Citations of this article
16Readers
Mendeley users who have this article in their library.

Abstract

Annotations are critical in various text mining tasks such as opinion mining, sentiment analysis, word sense disambiguation. Supervised learning algorithms start with the training of the classifier and require manually annotated datasets. However, manual annotations are often subjective, biased, onerous, and burdensome to develop; therefore, there is a need for automatic annotation. Automatic annotators automatically annotate the data for creating the training set for the supervised classifier, but lack subjectivity and ignore semantics of underlying textual structures. The objective of this research is to develop scalable and semantically rich automatic annotation system while incorporating domain dependent characteristics of the annotation process. The authors devised an enhanced bootstrapping algorithm for the automatic annotation of Tweets and employed distributional semantic models (LSA and Word2Vec) to augment the novel Bootstrapping algorithm and tested the proposed algorithm on the 12,000 crowd-sourced annotated Tweets and achieved a 68.56% accuracy which is higher than the baseline accuracy.

Cite

CITATION STYLE

APA

Mohd, M., Jan, R., & Hakak, N. (2020). Enhanced bootstrapping algorithm for automatic annotation of tweets. International Journal of Cognitive Informatics and Natural Intelligence, 14(2), 35–60. https://doi.org/10.4018/IJCINI.2020040103

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free