Sentiment lexicon learning is of paramount importance in sentiment analysis. One of the most considerable challenges in learning sentiment lexicons is their domain-specific behavior. Transferring knowledge acquired from a sentiment lexicon from one domain to another is an open research problem. In this study, we attempt to address this challenge by presenting a transfer learning approach that creates new learning insights for multiple domains of the same genre. We propose an unsupervised sentiment lexicon learning methodology scalable to new domains of the same genre. Incremental learning and the methodology learn polarity seed words from corpora of multiple automatically selected source domains. This process then transfers its genre-level knowledge of corpus-learned seed words to the target domains. The corpus-learned seed words are used for sentiment lexicon generation for multiple target domains of the same genre. The sentiment lexicon learning process is based on the latent semantic analysis technique and uses unlabeled training data from the source and target domains. The experiment was performed using 24 domains of the same genre, i.e., consumer product review. The proposed model displays the best results using standard evaluation measures compared with the competitive baselines. The proposed genre-based unsupervised approach achieves a maximum accuracy of 86% and outperforms methods recently presented in the literature.
CITATION STYLE
Sanagar, S., & Gupta, D. (2020). Unsupervised Genre-Based Multidomain Sentiment Lexicon Learning Using Corpus-Generated Polarity Seed Words. IEEE Access, 8, 118050–118071. https://doi.org/10.1109/ACCESS.2020.3005242
Mendeley helps you to discover research relevant for your work.