Predicting Movie Audience with Stacked Generalization by Combining Machine Learning Algorithms

5Citations
Citations of this article
11Readers
Mendeley users who have this article in their library.

Abstract

The Korea film industry has matured and the number of movie-watching per capita has reached the highest level in the world. Since then, movie industry growth rate is decreasing and even the total sales of movies per year slightly decreased in 2018. The number of moviegoers is the first factor of sales in movie industry and also an important factor influencing additional sales. Thus it is important to predict the number of movie audiences. In this study, we predict the cumulative number of audiences of films using stacking, an ensemble method. Stacking is a kind of ensemble method that combines all the algorithms used in the prediction. We use box office data from Korea Film Council and web comment data from Daum Movie (www.movie.daum.net). This paper describes the process of collecting and preprocessing of explanatory variables and explains regression models used in stacking. Final stacking model outperforms in the prediction of test set in terms of RMSE.

Cite

CITATION STYLE

APA

Park, J., & Lim, C. (2021). Predicting Movie Audience with Stacked Generalization by Combining Machine Learning Algorithms. Communications for Statistical Applications and Methods, 28(3), 217–232. https://doi.org/10.29220/CSAM.2021.28.3.217

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free