As data sparsity may produce unreliable recommendations in collaborative filtering-based recommender systems, it has been addressed by many researchers in related fields. Jaccard index is regarded as effective when combined with existing similarity measures to relieve data sparsity problem. However, the index only reflects how many items are co-rated by two users, without considering whether their ratings are evaluated similar or not. This paper proposes a novel improvement of Jaccard index, reflecting not only the ratio of co-rated items but also whether the ratings of each co-rated item by two users are both high, medium, or low. A genetic algorithm is employed to find the optimal weights of the levels of evaluations and the optimal boundaries between them. We conducted extensive experiments to find that the proposed index significantly outperforms Jaccard index on moderately sparse to dense datasets, in terms of both prediction and recommendation qualities.
CITATION STYLE
Lee, S. (2017). Improving Jaccard index using genetic algorithms for collaborative filtering. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 10385 LNCS, pp. 378–385). Springer Verlag. https://doi.org/10.1007/978-3-319-61824-1_41
Mendeley helps you to discover research relevant for your work.