Reducing tail latency of DNN-based recommender systems using in-storage processing

Minsub Kim; Sungjin Lee

Conference ProceedingsOPEN ACCESS

Reducing tail latency of DNN-based recommender systems using in-storage processing

APSys 2020 - Proceedings of the 2020 ACM SIGOPS Asia-Pacific Workshop on Systems (2020) 90-97

DOI: 10.1145/3409963.3410501

16Citations

16Readers

Get full text

Abstract

Most recommender systems are designed to comply with service level agreement (SLA) because prompt response to users' requests is the most important factor that decides the quality of service. Existing recommender systems, however, seriously suffer from long tail latency when the embedding tables cannot be entirely loaded in the main memory. In this paper, we propose a new SSD architecture called EMB-SSD, which mitigates the tail latency problem of recommender systems by leveraging in-storage processing. By offloading the data-intensive parts of the recommendation algorithm into an SSD, EMB-SSD not only reduces the data traffic between the host and the SSD, but also lowers software overheads caused by deep I/O stacks. Results show that EMB-SSD exhibits 47% and 25% shorter 99th percentile latency and average latency, respectively, over existing systems.

Author supplied keywords

Cite

CITATION STYLE

APA

Kim, M., & Lee, S. (2020). Reducing tail latency of DNN-based recommender systems using in-storage processing. In APSys 2020 - Proceedings of the 2020 ACM SIGOPS Asia-Pacific Workshop on Systems (pp. 90–97). Association for Computing Machinery. https://doi.org/10.1145/3409963.3410501

Reducing tail latency of DNN-based recommender systems using in-storage processing

Abstract

Author supplied keywords

Cite

Register to see more suggestions