Sentiment classification on product reviews has become a popular topic in the research community. In this paper, we propose an approach to generating multi-unigram features to enhance a negation-aware Naive Bayes classifier for sentiment classification on sentences of product reviews. We coin the term "multi-unigram feature" to represent a new kind of features that are generated in our proposed algorithm with capturing high-frequently co-appeared unigram features in the training data. We further make the classifier aware of negation expressions in the training and classification process to eliminate the confusions of the classifier that is caused by negation expressions within sentences. Extensive experiments on a human-labeled data set not only qualitatively demonstrate good quality of the generated multi-unigram features but also quantitatively show that our proposed approach beats three baseline methods. Experiments on impact analysis of parameters illustrate that our proposed approach stably outperforms the baseline methods. © 2010 Springer-Verlag Berlin Heidelberg.
CITATION STYLE
Wei, W., Gulla, J. A., & Fu, Z. (2010). Advanced Intelligent Computing Theories and Applications. Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 6215(2), 380–391. Retrieved from http://www.scopus.com/inward/record.url?eid=2-s2.0-77958492391&partnerID=tZOtx3y1
Mendeley helps you to discover research relevant for your work.