Latency-aware dynamic server and cooling capacity provisioner for data centers

2Citations
Citations of this article
7Readers
Mendeley users who have this article in their library.

Abstract

Data center operators generally overprovision IT and cooling capacities to address unexpected utilization increases that can violate service quality commitments. This results in energy wastage. To reduce this wastage, we introduce HCP (Holistic Capacity Provisioner), a service latency aware management system for dynamically provisioning the server and cooling capacity. Short-term load prediction is used to adjust the online server capacity to concentrate the workload onto the smallest possible set of online servers. Idling servers are completely turned off based on a separate long-term utilization predictor. HCP targets data centers that use chilled air cooling and varies the cooling provided commensurately, using adjustable aperture tiles and speed control of the blower fans in the air handler. An HCP prototype supporting a server heterogeneity is evaluated with real-world workload traces/requests and realizes up to 32% total energy savings while limiting the 99th-percentile and average latency increases to at most 6.67% and 3.24%, respectively, against a baseline system where all servers are kept online.

Cite

CITATION STYLE

APA

Desu, A., Puvvadi, U., Stachecki, T., Vishwakarma, S., Khalili, S., Ghose, K., & Sammakia, B. G. (2021). Latency-aware dynamic server and cooling capacity provisioner for data centers. In SoCC 2021 - Proceedings of the 2021 ACM Symposium on Cloud Computing (pp. 335–349). Association for Computing Machinery, Inc. https://doi.org/10.1145/3472883.3487015

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free