Saec: Similarity-aware embedding compression in recommendation systems

9Citations
Citations of this article
25Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Embedding methods are commonly used in recommender systems to represent features about user and item. An impeding practical challenge is that the large number of embedding vectors incurs substantial memory footprint for serving queries, especially as the number of features continues to grow. We propose an embedding compression system called Saec to address this challenge. Saec exploits the similarity among features within a field as they represent the same attribute of user or item, and uses clustering to compress the embeddings. We propose a new fast clustering method that relies on the empirical heavy-tailed nature of features to drastically reduce the clustering overhead. We implement a prototype of Saec on a production system and evaluate it with private feature datasets from a large Internet company. Testbed experiments show that Saec consistently reduces the memory footprint of embedding vectors from 4.46 GB to 161 MB. The fast clustering method we design achieves 32x speedup compared to the baseline method.

Cite

CITATION STYLE

APA

Wu, X., Xu, H., Zhang, H., Chen, H., & Wang, J. (2020). Saec: Similarity-aware embedding compression in recommendation systems. In APSys 2020 - Proceedings of the 2020 ACM SIGOPS Asia-Pacific Workshop on Systems (pp. 82–89). Association for Computing Machinery. https://doi.org/10.1145/3409963.3410498

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free