Traditional methods for sentiment classification based on supervised learning require a large amount of labeled data for training. However, It is hard to obtain enough labeled data because it can be too expensive compared with unlabeled data. In this paper, we propose an identification of sentiment labels based on self-training (ISLS) method that can make full use of the large number of labeled data. We extract sentiment expressions based on sentiment seeds by self-training, learn sentiment words on unlabeled data and annotate unlabeled data. The sentiment expressions include processing and extracting for the negative meaning of the text. The ISLS method avoids the subjective problems of manual annotation. Experiments validate the effectiveness of the proposed ISLS method.
CITATION STYLE
Qu, Z., Wu, C., Wang, X., & Zhao, Y. (2018). Identification of sentiment labels based on self-training. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 10943 LNCS, pp. 404–413). Springer Verlag. https://doi.org/10.1007/978-3-319-93803-5_38
Mendeley helps you to discover research relevant for your work.