Comparative recommender system evaluation

  • Said A
  • Bellogín A
N/ACitations
Citations of this article
67Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Recommender systems research is often based on comparisons of predictive accuracy: the better the evaluation scores, the better the recommender. However, it is difficult to compare results from dif-ferent recommender systems due to the many options in design and implementation of an evaluation strategy. Additionally, algorithmic implementations can diverge from the standard formulation due to manual tuning and modifications that work better in some situations. In this work we compare common recommendation algorithms as implemented in three popular recommendation frameworks. To pro-vide a fair comparison, we have complete control of the evaluation dimensions being benchmarked: dataset, data splitting, evaluation strategies, and metrics. We also include results using the internal evaluation mechanisms of these frameworks. Our analysis points to large differences in recommendation accuracy across frameworks and strategies, i.e. the same baselines may perform orders of magni-tude better or worse across frameworks. Our results show the neces-sity of clear guidelines when reporting evaluation of recommender systems to ensure reproducibility and comparison of results.

Cite

CITATION STYLE

APA

Said, A., & Bellogín, A. (2014). Comparative recommender system evaluation (pp. 129–136). Association for Computing Machinery (ACM). https://doi.org/10.1145/2645710.2645746

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free