Neural personalized recommendation models are used across a wide variety of datacenter applications including search, social media, and entertainment. State-of-the-art models comprise large embedding tables that have billions of parameters requiring large memory capacities. Unfortunately, large and fast DRAM-based memories levy high infrastructure costs. Conventional SSD-based storage solutions offer an order of magnitude larger capacity, but have worse read latency and bandwidth, degrading inference performance. RecSSD is a near data processing based SSD memory system customized for neural recommendation inference that reduces end-to-end model inference latency by 2× compared to using COTS SSDs across eight industry-representative models.
CITATION STYLE
Wilkening, M., Gupta, U., Hsia, S., Trippel, C., Wu, C. J., Brooks, D., & Wei, G. Y. (2021). RecSSD: Near data processing for solid state drive based recommendation inference. In International Conference on Architectural Support for Programming Languages and Operating Systems - ASPLOS (pp. 717–729). Association for Computing Machinery. https://doi.org/10.1145/3445814.3446763
Mendeley helps you to discover research relevant for your work.