In sentiment classification, labeled data is often limited while unlabeled data is ample. This motivates semi-supervised learning for sentiment classification to improve the performance by exploring the knowledge in unlabeled data. In this paper, we analyze the possibility and the difficulty of semisupervised sentiment classification and indicate that noisy features may be the main reason for badly influencing the performance. To overcome this problem, we propose a novel self-training approach where multiple feature subspace-based classifiers are utilized to explore a set of good features for better classification decision and to select the informative samples for automatically labeling. Evaluation over multiple data sets shows the effectiveness of our self-training approach for semi-supervised sentiment classification.
CITATION STYLE
Gao, W., Li, S., Xue, Y., Wang, M., & Zhou, G. (2014). Semi-supervised sentiment classification with self-training on feature subspaces. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 8922, pp. 231–239). Springer Verlag. https://doi.org/10.1007/978-3-319-14331-6_23
Mendeley helps you to discover research relevant for your work.