Improving movie gross prediction through news analysis

  • Zhang W
  • Skiena S
  • 57


    Mendeley users who have this article in their library.
  • 37


    Citations of this article.


Traditional movie gross predictions are based on numerical and categorical movie data from The Internet Movie Database (IMDB). In this paper, we use the quantitative news data generated by Lydia, our system for large-scale news analysis, to help people to predict movie grosses. By analyzing two different models (regression and k-nearest neighbor models), we find models using only news data can achieve similar performance to those using IMDB data. Moreover, we can achieve better performance by using the combination of IMDB data and news data. Further, the improvement is statistically significant.

Get free article suggestions today

Mendeley saves you time finding and organizing research

Sign up here
Already have an account ?Sign in

Find this document


  • Wenbin Zhang

  • Steven Skiena

Cite this document

Choose a citation style from the tabs below

Save time finding and organizing research with Mendeley

Sign up for free