Caching with Delayed Hits

Nirav Atre; Justine Sherry; Weina Wang; Daniel S. Berger

Conference Proceedings

Caching with Delayed Hits

SIGCOMM 2020 - Proceedings of the 2020 Annual Conference of the ACM Special Interest Group on Data Communication on the Applications, Technologies, Architectures, and Protocols for Computer Communication (2020) 495-513

DOI: 10.1145/3387514.3405883

47Citations

51Readers

Get full text

Abstract

Caches are at the heart of latency-sensitive systems. In this paper, we identify a growing challenge for the design of latency-minimizing caches called delayed hits. Delayed hits occur at high throughput, when multiple requests to the same object queue up before an outstanding cache miss is resolved. This effect increases latencies beyond the predictions of traditional caching models and simulations; in fact, caching algorithms are designed as if delayed hits simply didn't exist. We show that traditional caching strategies-even so called 'optimal' algorithms-can fail to minimize latency in the presence of delayed hits. We design a new, latency-optimal offline caching algorithm called belatedly which reduces average latencies by up to 45% compared to the traditional, hit-rate optimal Belady's algorithm. Using belatedly as our guide, we show that incorporating an object's 'aggregate delay' into online caching heuristics can improve latencies for practical caching systems by up to 40%. We implement a prototype, Minimum-AggregateDelay (mad), within a CDN caching node. Using a CDN production trace and backends deployed in different geographic locations, we show that mad can reduce latencies by 12-18% depending on the backend RTTs.

Author supplied keywords

Cite

CITATION STYLE

APA

Atre, N., Sherry, J., Wang, W., & Berger, D. S. (2020). Caching with Delayed Hits. In SIGCOMM 2020 - Proceedings of the 2020 Annual Conference of the ACM Special Interest Group on Data Communication on the Applications, Technologies, Architectures, and Protocols for Computer Communication (pp. 495–513). Association for Computing Machinery. https://doi.org/10.1145/3387514.3405883

Caching with Delayed Hits

Abstract

Author supplied keywords

Cite

Register to see more suggestions