Text mining and sentiment analysis for predicting box office success

18Citations
Citations of this article
68Readers
Mendeley users who have this article in their library.

Abstract

After emerging online communications, text mining and sentiment analysis has been frequently applied into analyzing electronic word-of-mouth. This study aims to develop a domain-specific lexicon of sentiment analysis to predict box office success in Korea film market and validate the feasibility of the lexicon. Natural language processing, a machine learning algorithm, and a lexicon-based sentiment classification method are employed. To create a movie domain sentiment lexicon, 233,631 reviews of 147 movies with popularity ratings is collected by a XML crawling package in R program. We accomplished 81.69% accuracy in sentiment classification by the Korean sentiment dictionary including 706 negative words and 617 positive words. The result showed a stronger positive relationship with box office success and consumers’ sentiment as well as a significant positive effect in the linear regression for the predicting model. In addition, it reveals emotion in the user-generated content can be a more accurate clue to predict business success.

Cite

CITATION STYLE

APA

Kim, Y., Kang, M., & Jeong, S. R. (2018). Text mining and sentiment analysis for predicting box office success. KSII Transactions on Internet and Information Systems, 12(8), 4090–4102. https://doi.org/10.3837/tiis.2018.08.030

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free