Abstract
The rapid growth in social networking services has led to the generation of a massive volume of opinionated information in the form of electronic text. As a result, the research on text sentiment analysis has drawn a great deal of interest. In this paper a novel feature weighting approach is proposed for the sentiment analysis of Twitter data. It properly measures the relative significance of each feature regarding both intra-category and intra-category distribution. A new statistical model called Category Discriminative Strength is introduced to characterize the discriminability of the features among various categories, and a modified Chi-square (χ2)-based measure is employed to measure the intra-category dependency of the features. Moreover, a fine-grained feature clustering strategy is proposed to maximize the accuracy of the analysis. Extensive experiments demonstrate that the proposed approach significantly outperforms four state-of-the-art sentiment analysis techniques in terms of accuracy, precision, recall, and F1 measure with various sizes and patterns of training and test datasets.
Author supplied keywords
Cite
CITATION STYLE
Wang, Y., & Youn, H. (2019). Feature weighting based on inter-category and intra-category strength for Twitter sentiment analysis. Applied Sciences (Switzerland), 9(1). https://doi.org/10.3390/app9010092
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.