An empirical evaluation of evaluation metrics of procedurally generated mario levels

Julian R.H. Mariño; Willian M.P. Reis; Levi H.S. Lelis

Conference ProceedingsOPEN ACCESS

An empirical evaluation of evaluation metrics of procedurally generated mario levels

Proceedings of the 11th AAAI Conference on Artificial Intelligence and Interactive Digital Entertainment, AIIDE 2015 (2015) 2015-November 44-50

DOI: 10.1609/aiide.v11i1.12785

27Citations

20Readers

Abstract

There are several approaches in the literature for automatically generating Infinite Mario Bros levels. The evaluation of such approaches is often performed solely with computational metrics such as leniency and linearity. While these metrics are important for an initial exploratory evaluation of the content generated, it is not clear whether they are able to capture the player's perception of the content generated. In this paper we evaluate several of the commonly used computational metrics. Namely, we perform a systematic user study with procedural content generation systems and compare the insights gained from our user study with those gained from analyzing the computational metric values. The results of our experiment suggest that current computational metrics should not be used in lieu of user studies for evaluating content generated by computer programs.

Cite

CITATION STYLE

APA

Mariño, J. R. H., Reis, W. M. P., & Lelis, L. H. S. (2015). An empirical evaluation of evaluation metrics of procedurally generated mario levels. In Proceedings of the 11th AAAI Conference on Artificial Intelligence and Interactive Digital Entertainment, AIIDE 2015 (Vol. 2015-November, pp. 44–50). AAAI Press. https://doi.org/10.1609/aiide.v11i1.12785

An empirical evaluation of evaluation metrics of procedurally generated mario levels

Abstract

Cite

Register to see more suggestions