SimClusters: Community-Based Representations for Heterogeneous Recommendations at Twitter

37Citations
Citations of this article
80Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Personalized recommendation products at Twitter target a multitude of heterogeneous items: Tweets, Events, Topics, Hashtags, and users. Each of these targets varies in their cardinality (which affects the scale of the problem) and their "shelf life'' (which constrains the latency of generating the recommendations). Although Twitter has built a variety of recommendation systems before dating back a decade, solutions to the broader problem were mostly tackled piecemeal. In this paper, we present SimClusters, a general-purpose representation layer based on overlapping communities into which users as well as heterogeneous content can be captured as sparse, interpretable vectors to support a multitude of recommendation tasks. We propose a novel algorithm for community discovery based on Metropolis-Hastings sampling, which is both more accurate and significantly faster than off-the-shelf alternatives. SimClusters scales to networks with billions of users and has been effective across a variety of deployed applications at Twitter.

Cite

CITATION STYLE

APA

Satuluri, V., Wu, Y., Zheng, X., Qian, Y., Wichers, B., Dai, Q., … Lin, J. (2020). SimClusters: Community-Based Representations for Heterogeneous Recommendations at Twitter. In Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (pp. 3183–3193). Association for Computing Machinery. https://doi.org/10.1145/3394486.3403370

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free