Page 1
Recommender Systems for Social Bo...
Recommender Systems for Social Bookmarking PROEFSCHRIFT ter verkrijging van de graad van doctor aan de Universiteit van Tilburg, op gezag van de rector magnificus, prof. dr. Ph. Eijlander, in het openbaar te verdedigen ten overstaan van een door het college voor promoties aangewezen commissie in de aula van de Universiteit op dinsdag 8 december 2009 om 14.15 uur door Antonius Marinus Bogers, geboren op 21 september 1979 te Roosendaal en Nispen
Page 2
Promotor: Prof. dr. A.P.J. van den Bosch Beoordelingscommissie: Prof. dr. H.J. van den Herik Prof. dr. M. de Rijke Prof. dr. L. Boves Dr. B. Larsen Dr. J.J. Paijmans The research reported in this thesis has been funded by SenterNovem / the Dutch Ministry of Economic Affairs as part of the IOP-MMI �� Propos project. SIKS Dissertation Series No. 2009-42 The research reported in this thesis has been carried out under the auspices of SIKS, the Dutch Research School for Information and Knowledge Systems. TiCC Dissertation Series No. 10 ISBN 978-90-8559-582-3 Copyright c 2009, A.M. Bogers All rights reserved. No part of this publication may be reproduced, stored in a retrieval sys- tem, or transmitted, in any form or by any means, electronically, mechanically, photocopying, recording or otherwise, without prior permission of the author.
Page 3
��� The Web, they say, is leaving the era of search and entering one of discovery. What���s the difference? Search is what you do when you���re looking for some- thing. Discovery is when something wonderful that you didn���t know existed, or didn���t know how to ask for, finds you. ��� Jeffrey M. O���Brien
Page 4
iii
Page 5
PREFACE First and foremost I would like to thank my supervisor and promotor Antal van den Bosch, who guided me in my first steps as a researcher, both for my Master���s thesis and my Ph.D. research. Antal always gave me free reign in investigating many different research prob- lems, while at the same time managing to steer me in the right direction when the time called for it. Antal was always able to make time for me or any of the other Ph.D. students, and read and comment on paper or presentation drafts. In addition to turning me into a better researcher, Antal was also instrumental in improving my Guitar Hero skills. Our thesis meetings during your sabbatical doubled as a kind of Rock ���n Roll Fantasy Camp, where we could both unwind from discussing yet another batch of experiments I had run or was planning to run. Rock on! Antal also shares my passion for ice hockey. This resulted in us attending Tilburg Trappers games in Stappegoor as well as our regular discussions of the latest hockey news. Thanks for inviting me to come see the NHL All Star games in Breda. Hopefully we will meet again in spirit come May 2010 when the Canucks beat the Penguins in the Stanley Cup finals! The research presented in this thesis was performed in the context of the �� Propos project. I would like to acknowledge SenterNovem and the Dutch Ministry of Economic Affairs for funding this project as part of the IOP-MMI program. The �� Propos project was started by Lou Boves, Antal, and Frank Hofstede. I would like to thank Lou and Frank in particular. Frank was always able to look at my research problems from a different and more practical angle, and as a result our discussions were always very stimulating. I would also like to Mari Carmen Puerta-Melguizo, Anita Deshpande, and Els den Os, as well as the other members and attendees of the project meetings for the pleasant cooperation and helpful comments and suggestions. I wish to thank the members of my committee for taking time out of their busy schedules to read my dissertation and attending my defense: Jaap van den Herik, Maarten de Rijke, Lou Boves, Birger Larsen, and Hans Paijmans. Special thanks go to Jaap for his willingness to go through my thesis with a fine-grained comb. The readability of the final text has benefited greatly from his meticulous attention to detail and quality. Any errors remaining in the thesis are my own. I would also like to thank Birger for his comments, which helped to dot the i���s and cross the t���s of the final product. Finally, I would like to thank Hans Paijmans, who contributed considerably to my knowledge of IR. iv
Page 6
Preface v My Ph.D. years would not have been as enjoyable and successful without my colleagues at Tilburg University, especially those at the ILK group. It is not everywhere that the bond between colleagues is as strong as it was in ILK and I will not soon forget the coffee breaks with the Sulawesi Boys, the BBQs and Guitar Hero parties, lunch runs, after-work drinks, and the friendly and supportive atmosphere on the 3rd floor of Dante. I do not have enough room to thank everyone personally here, you know who you are. In your own way, you all contributed to this thesis. Over the course of my Ph.D. I have spent many Fridays at the Science Park in Amsterdam, working with members of the ILPS group headed by Maarten de Rijke. I would like to thank Erik Tjong Kim Sang for setting this up and Maarten for allowing me to become a guest researcher at his group. Much of what I know about doing IR research, I learned from these visits. From small things like visualizing research results and LaTeX layout to IR research methodology and a focus on empirical, task-driven research. I hope that some of what I have learned shows in the thesis. I would like to thank all of the ILPS members, but especially Krisztian, Katja, and Maarten for collaborating with me on expert search, which has proven to be a very fruitful collaboration so far. I have also had the pleasure of working at the Royal School of Library and Information Science in Copenhagen. I am most grateful to Birger Larsen and Peter Ingwersen, for helping to arrange my visit and guiding me around. Thanks are also due to Mette, Haakon, Charles, Jette, and the other members of the IIIA group for welcoming me and making me feel at home. Jeg glaeder mig til at arbejde sammen med jer snart. Thanks are due to Sunil Patel for designing part of the stylesheet of this thesis and to Jonathan Feinberg of http://www.wordle.net/ for the word cloud on the front of this thesis. I owe Maarten Clements a debt of gratitude for helping me to more efficiently im- plement his random walk algorithm. And of course thanks to BibSonomy, CiteULike, and Delicious for making the research described in this thesis possible. Finally, I would like to thank the three most important groups of people in my life. My friends, for always supporting me and taking my mind off my work. Thanks for all the din- ners, late-night movies, pool games, talks, vacations and trips we have had so far! Thanks to my parents for always supporting me and believing in me without you I would not have been where I am today. Kirstine, thanks for putting up with me while I was distracted by my work, and thanks for patiently reading and commenting on my Ph.D. thesis. Og tusind tak fordi du bringer s�� meget glaede, latter og kaerlighed ind i mit liv. Det her er til Timmy og Oinky!
Page 7
CONTENTS Preface iv 1 Introduction 1 1.1 Social Bookmarking . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2 1.2 Scope of the Thesis . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3 1.3 Problem Statement and Research Questions . . . . . . . . . . . . . . . . . . . . . 3 1.4 Research Methodology . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 1.5 Organization of the Thesis . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6 1.6 Origins of the Material . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7 2 Related Work 9 2.1 Recommender Systems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9 2.1.1 Collaborative Filtering . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10 2.1.2 Content-based Filtering . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13 2.1.3 Knowledge-based Recommendation . . . . . . . . . . . . . . . . . . . . . 14 2.1.4 Recommending Bookmarks & References . . . . . . . . . . . . . . . . . . 15 2.1.5 Recommendation in Context . . . . . . . . . . . . . . . . . . . . . . . . . . 17 2.2 Social Tagging . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 21 2.2.1 Indexing vs. Tagging . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 22 2.2.2 Broad vs. Narrow Folksonomies . . . . . . . . . . . . . . . . . . . . . . . . 24 2.2.3 The Social Graph . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 25 2.3 Social Bookmarking . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 26 2.3.1 Domains . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 27 2.3.2 Interacting with Social Bookmarking Websites . . . . . . . . . . . . . . . 28 2.3.3 Research tasks . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 29 I Recommending Bookmarks 3 Building Blocks for the Experiments 35 3.1 Recommender Tasks . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 35 3.2 Data Sets . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 37 3.2.1 CiteULike . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 41 3.2.2 BibSonomy . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 42 vi
Page 8
Contents vii 3.2.3 Delicious . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 44 3.3 Data Representation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 46 3.4 Experimental Setup . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 47 3.4.1 Filtering . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 48 3.4.2 Evaluation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 50 3.4.3 Discussion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 52 4 Folksonomic Recommendation 55 4.1 Preliminaries . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 56 4.2 Popularity-based Recommendation . . . . . . . . . . . . . . . . . . . . . . . . . . 58 4.3 Collaborative Filtering . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 60 4.3.1 Algorithm . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 60 4.3.2 Results . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 64 4.3.3 Discussion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 64 4.4 Tag-based Collaborative Filtering . . . . . . . . . . . . . . . . . . . . . . . . . . . 66 4.4.1 Tag Overlap Similarity . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 66 4.4.2 Tagging Intensity Similarity . . . . . . . . . . . . . . . . . . . . . . . . . . 68 4.4.3 Similarity Fusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 68 4.4.4 Results . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 70 4.4.5 Discussion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 72 4.5 Related work . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 74 4.6 Comparison to Related Work . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 76 4.6.1 Tag-aware Fusion of Collaborative Filtering Algorithms . . . . . . . . . 77 4.6.2 A Random Walk on the Social Graph . . . . . . . . . . . . . . . . . . . . . 78 4.6.3 Results . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 80 4.6.4 Discussion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 81 4.7 Chapter Conclusions and Answer to RQ 1 . . . . . . . . . . . . . . . . . . . . . . 82 5 Exploiting Metadata for Recommendation 85 5.1 Contextual Metadata in Social Bookmarking . . . . . . . . . . . . . . . . . . . . 86 5.2 Exploiting Metadata for Item Recommendation . . . . . . . . . . . . . . . . . . . 88 5.2.1 Content-based Filtering . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 88 5.2.2 Hybrid Filtering . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 91 5.2.3 Similarity Matching . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 93 5.2.4 Selecting Metadata Fields for Recommendation Runs . . . . . . . . . . 94 5.3 Results . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 95 5.3.1 Content-based Filtering . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 95 5.3.2 Hybrid Filtering . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 97 5.3.3 Comparison to Folksonomic Recommendation . . . . . . . . . . . . . . . 98 5.4 Related Work . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 99 5.4.1 Content-based Filtering . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 99 5.4.2 Hybrid Filtering . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 101 5.5 Discussion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 102 5.6 Chapter Conclusions and Answer to RQ 2 . . . . . . . . . . . . . . . . . . . . . . 105 6 Combining Recommendations 107 6.1 Related Work . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 108 6.1.1 Fusing Recommendations . . . . . . . . . . . . . . . . . . . . . . . . . . . 108
Page 9
Contents viii 6.1.2 Data Fusion in Machine Learning and IR . . . . . . . . . . . . . . . . . . 110 6.1.3 Why Does Fusion Work? . . . . . . . . . . . . . . . . . . . . . . . . . . . . 111 6.2 Fusing Recommendations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 112 6.3 Selecting Runs for Fusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 114 6.4 Results . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 115 6.4.1 Fusion Analysis . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 117 6.4.2 Comparing All Fusion Methods . . . . . . . . . . . . . . . . . . . . . . . . 119 6.5 Discussion & Conclusions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 120 6.6 Chapter Conclusions and Answer to RQ 3 . . . . . . . . . . . . . . . . . . . . . . 121 II Growing Pains: Real-world Issues in Social Bookmarking 7 Spam 125 7.1 Related Work . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 126 7.2 Methodology . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 128 7.2.1 Data Collection . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 129 7.2.2 Data Representation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 130 7.2.3 Evaluation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 132 7.3 Spam Detection for Social Bookmarking . . . . . . . . . . . . . . . . . . . . . . . 132 7.3.1 Language Models for Spam Detection . . . . . . . . . . . . . . . . . . . . 133 7.3.2 Spam Classification . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 135 7.3.3 Results . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 136 7.3.4 Discussion and Conclusions . . . . . . . . . . . . . . . . . . . . . . . . . . 138 7.4 The Influence of Spam on Recommendation . . . . . . . . . . . . . . . . . . . . . 140 7.4.1 Related Work . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 140 7.4.2 Experimental Setup . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 141 7.4.3 Results and Analysis . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 142 7.5 Chapter Conclusions and Answer to RQ 4 . . . . . . . . . . . . . . . . . . . . . . 145 8 Duplicates 147 8.1 Duplicates in CiteULike . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 148 8.2 Related Work . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 149 8.3 Duplicate Detection . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 151 8.3.1 Creating a Training Set . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 151 8.3.2 Constructing a Duplicate Item Classifier . . . . . . . . . . . . . . . . . . . 153 8.3.3 Results and Analysis . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 157 8.4 The Influence of Duplicates on Recommendation . . . . . . . . . . . . . . . . . . 160 8.4.1 Experimental Setup . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 160 8.4.2 Results and Analysis . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 162 8.5 Chapter Conclusions and Answer to RQ 5 . . . . . . . . . . . . . . . . . . . . . . 164 III Conclusion 9 Discussion and Conclusions 169 9.1 Answers to Research Questions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 169 9.2 Recommendations for Recommendation . . . . . . . . . . . . . . . . . . . . . . . 172
Page 10
Contents ix 9.3 Summary of Contributions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 173 9.4 Future Directions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 174 References 177 Appendices A Collecting the CiteULike Data Set 191 A.1 Extending the Public Data Dump . . . . . . . . . . . . . . . . . . . . . . . . . . . . 191 A.2 Spam Annotation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 193 B Glossary of Recommendation Runs 195 C Optimal Fusion Weights 197 D Duplicate Annotation in CiteULike 203 List of Figures 205 List of Tables 207 List of Abbreviations 209 Summary 211 Samenvatting 215 Curriculum Vitae 219 Publications 221 SIKS Dissertation Series 223 TiCC Dissertation Series 229
Page 11
C HAPTER 1 INTRODUCTION For the past two decades, the World Wide Web has expanded at enormous rate. The first generation of the World Wide Web (WWW) enabled users to have instantaneous access to a large diversity of knowledge items. The second generation of the WWW is usually denoted by Web 2.0. It signifies a fundamental change in the way people interact with and through the World Wide Web. Web 2.0 is also referred to as the participatory Web. It can be characterized as a paradigm that facilitates communication, interoperability, user-centered design, and information sharing and collaboration on the Web (O���Reilly, 2005 Sharma, 2008). Moreover, in the transition to Web 2.0 we see a paradigm shift from local and solitary to global and collaborative. Also, this shift coincides with a shift from accessing and creating information to understanding information and understanding the people who deal with this information. Instead of creating, storing, managing, and accessing information on only one specific computer or browser, information management and access has been moving to many distributed places on the Web. Collaboratively created websites such as Wikipedia are edited and accessed by anyone, and users can document and share any aspect of their lives online using blogs, social networking sites, and video and photo sharing sites. This thesis deals with recommender systems, social tagging, and social bookmarking. What are the relations between these three elements, and can we build recommender systems that profit from the presence of the other two elements? Assuming that we can, what are the threats from the outside or inside of this new part of the WWW? In the thesis we deal with spam as the outside threat, and duplicates as the inside threat. The aim of the thesis is to understand the symbiosis of recommender systems, social tagging, and social bookmarking, and to design mechanisms that successfully counter the threats from the outside and from the inside. The course of this chapter is as follows. We introduce social bookmarking in Section 1.1. It is followed by a description of the scope of the thesis. The problem statement and five research questions are formulated in Section 1.3. Section 1.4 describes the research methodology. The structure of the thesis is provided in Section 1.5. Finally, Section 1.6 points to the origins of the material. 1
Page 12
Chapter 1. Introduction 2 1.1 Social Bookmarking Social bookmarking is a rather new phenomenon: instead of keeping a local copy of point- ers to favorite URLs, users can instead store and access their bookmarks online through a Web interface. The underlying application then makes all stored information shareable among users. Closely related to social bookmarking websites are the so-called social refer- ence managers, which follow the same principle, but with a focus on the online management and access of scientific articles and papers. Social bookmarking websites have seen a rapid growth in popularity and a high degree of activity by their users. For instance, Delicious1 is one of the most popular social bookmarking services. It received an average of 140,000 posts per day in 2008 according to the independently sampled data collected by Philipp Keller2. In addition to the aforementioned functionality, most social ���storage��� services also offer the user the opportunity to describe by keywords the content they added to their per- sonal profile. These keywords are commonly referred to as tags. They are an addition to e.g., the title and summary metadata commonly used to annotate content, and to improve the access and retrievability of a user���s own bookmarked Web pages. These tags are then made available to all users, many of whom have annotated many of the same Web pages with possibly overlapping tags. This results in a rich network of users, bookmarks, and tags, commonly referred to as a folksonomy. This social tagging phenomenon and the resulting folksonomies have become a staple of many Web 2.0 websites and services (Golder and Huberman, 2006). The emerging folksonomy on a social bookmarking website can be used to enhance a variety of tasks, such as searching for specific content. It can also enable the active discovery of new content by allowing users to browse through the richly connected network. A user could select one of his3 tags to explore all bookmarks annotated with that tag by the other users in the network, or locate like-minded users by examining a list of all other users who added a particular bookmark, possibly resulting in serendipitously discovered content (Marlow et al., 2006). Both browsing and searching the system, however, require active user participation to locate new and interesting content. As the system increases in popularity and more users as well as content enter the system, the access methods become less effective at finding all the interesting content present in the system. The information overload problem caused by this growing influx of users and content means that search and browsing, which require active participation, are not always the most practical or preferable ways of locating new and interesting content. Typically, users only have a limited amount of time to go through the search results. Assuming users know about the existence of the relevant content and know how to formulate the appropriate queries they may arrive in time at the preferred places. But what happens when the search and browse process becomes less effective? And what if the user does not know about all relevant content available in the system? Our interest, and the focus of this thesis, lies in using recommender systems to help the user with this information overload problem, and automatically find interesting content for the user. A recommender system is a type of personalized information filtering technology used to identify sets of items that are likely to be of interest to a certain user, using a variety of 1http://www.delicious.com/ 2Available at http://deli.ckoma.net/stats last visited January 2009. 3In this thesis, we use ���his��� and ���he��� to refer to both genders.
Page 13
Chapter 1. Introduction 3 information sources related to both the user and the content items (Resnick and Varian, 1997). 1.2 Scope of the Thesis In this thesis, we investigate how recommender systems can be applied to the domain of social bookmarking. More specifically, we want to investigate the task of item recommenda- tion. For this purpose, interesting and relevant items���bookmarks or scientific articles���are retrieved and recommended to the user. Recommendations can be based on a variety of information sources about the user and the items. It is a difficult task as we are trying to predict which items out of a very large pool would be relevant given a user���s interests, as represented by the items which the user has added in the past. In our experiments we dis- tinguish between two types of information sources. The first one is usage data contained in the folksonomy, which represents the past selections and transactions of all users, i.e., who added which items, and with what tags. The second information source is the metadata describing the bookmarks or articles on a social bookmarking website, such as title, de- scription, authorship, tags, and temporal and publication-related metadata. We are among the first to investigate this content-based aspect of recommendation for social bookmark- ing websites. We compare and combine the content-based aspect with the more common usage-based approaches. Because of the novelty of applying recommender systems to social bookmarking websites, there is not a large body of related work, results, and design principles to build on. We therefore take a system-based approach for the evaluation our work. We try to simulate, as realistically as possible, the reaction of the user to different variants of the recommenda- tion algorithms in a controlled laboratory setting. We focus on two specific domains: (1) recommending bookmarks of Web pages and (2) recommending bookmarked references to scientific articles. It is important to remark, however, that a system-based evaluation can only provide us with a provisional estimate of how well our algorithms are doing. User sat- isfaction is influenced by more than just recommendation accuracy (Herlocker et al., 2004) and it would be essential to follow up our work with an evaluation on real users in realistic situations. However, this is not the focus of the thesis, nor will we focus on tasks such as tag recommendation or finding like-minded users. We focus strictly on recommending items. 1.3 Problem Statement and Research Questions As stated above, the rich information contained in social bookmarking websites can be used to support a variety of tasks. We consider three important ones: browsing, search, and recommendation. From these three, we focus on (item) recommendation in this thesis. In this context we may identify two types of key characteristics of social bookmarking websites that can be used in the recommendation process. We remark that the information sources represented by these characteristics are not always simultaneously available in every rec- ommendation scenario. The resulting recommendations are produced by (1) collaborative
Page 14
Chapter 1. Introduction 4 filtering algorithms and (2) content-based filtering algorithms. We briefly discuss both types of algorithms and the associated characteristics below. Collaborative filtering algorithms Much of the research in recommender systems has fo- cused on exploiting sets of usage patterns that represent user preferences and transac- tions. The class of algorithms that operate on this source of information are called Col- laborative Filtering (CF) algorithms. They automate the process of ���word-of-mouth��� recommendation: items are recommended to a user based on how like-minded users rated those items (Goldberg et al., 1992 Shardanand and Maes, 1995). In the so- cial bookmarking domain, we have an extra layer of usage data at our disposal in the folksonomy in the form of tags. This extra layer of collaboratively generated tags binds the users and items of a system together in yet another way, opening up many possibilities for new algorithms that can take advantage of this data. Content-based filtering algorithms Social bookmarking services and especially social ref- erence managers are also characterized by the rich metadata describing the content added by their users. Recommendation on the basis of textual information is com- monly referred to as content-based filtering (Goldberg et al., 1992) and matches the item metadata against a representation of the user���s interest to produce new recom- mendations. The metadata available on social bookmarking services describe many different aspects of the items posted to the website. It may comprise both personal information, such as reviews and descriptions, as well as general metadata that is the same for all users. While the availability of metadata is not unique to social bookmarking���movie recommenders, for instance, also have a rich set of metadata at their disposal (Paulson and Tzanavari, 2003)���it might be an important information source for generating item recommendations. Having distinguished the two types of characteristics of social bookmarking websites, we are now able to formulate our problem statement (PS). PS How can the characteristics of social bookmarking websites be exploited to produce the best possible item recommendations for users? To address this problem statement, we formulate five research questions. The first two research questions belong together. They read as follows. RQ 1 How can we use the information represented by the folksonomy to sup- port and improve the recommendation performance? RQ 2 How can we use the item metadata available in social bookmarking sys- tems to provide accurate recommendations to users? After answering the first two questions, we know how to exploit in the best manner the two types of information sources���the folksonomy and item metadata���to produce accurate recommendations. This leads us to our third research question.
Page 15
Chapter 1. Introduction 5 RQ 3 Can we improve performance by combining the recommendations gen- erated by different algorithms? These are the three main research questions. As mentioned earlier, we evaluate our answers to these questions by simulating the user���s interaction with our proposed recommendation algorithms in a laboratory setting. However, such an idealized perspective does not take into account the dynamic growth issues caused by the increasing popularity of social book- marking websites. Therefore, we focus on two of these growing pains. There is one pain attacking social bookmarking websites from the outside, spam. The other one, duplicate content, attacks a social bookmarking website from the inside. They lead to our final two research questions. RQ 4 How big a problem is spam for social bookmarking services? RQ 5 How big a problem is the entry of duplicate content for social bookmark- ing services? Wherever it is applicable and aids our investigation, we will break down these questions into separate and even more specific research questions. 1.4 Research Methodology The research methodology followed in the thesis comprises five parts: (1) reviewing the literature, (2) analyzing the findings, (3) designing the recommendation algorithms, (4) evaluating the algorithms, and (5) designing protection mechanisms for two growing pains. First, we conduct a literature review to identify the main techniques, characteristics, and issues in the fields of recommender systems, social tagging, and social bookmarking, and in the intersection of the three fields. In addition, Chapters 4 through 8 each contain short literature reviews specifically related to the work described in the respective chapters. Second, we analyze the findings from the literature. We use these in the third part of our methodology to guide us in the development of recommendation algorithms specifically suited for item recommendation on social bookmarking websites. Fourth, we evaluate our recommendation algorithms in a quantitative manner. The build- ing blocks of our quantitative evaluation are described in more detail in Chapter 3. Our quantitative evaluation is based on a so-called backtesting approach to evaluation that is common in recommender systems literature (Breese et al., 1998 Herlocker et al., 2004 Baluja et al., 2008). In backtesting, we evaluate on a per-user basis. We withhold ran- domly selected items from each user profile, and generate recommendations by using the remaining data as training material. If a user���s withheld items are predicted at the top of the ranked list of recommendations, then the algorithm is considered to perform well for that user. The performance of a recommendation algorithm is averaged over the performance for all individual users. In our evaluation we employ cross-validation, which can provide a