Improvements of the reactive auto scaling method for cloud platform

Dariusz Rafal Augustyn

Conference Proceedings

Improvements of the reactive auto scaling method for cloud platform

Augustyn D

Communications in Computer and Information Science (2017) 718 422-431

DOI: 10.1007/978-3-319-59767-6_33

3Citations

8Readers

Get full text

Abstract

Elements of cloud infrastructure like load balancers, instances of virtual server (service nodes), storage services are used in an architecture of modern cloud-enabled systems. Auto scaling is a mechanism which allows to on-line adapt efficiency of a system to current load. It is done by increasing or decreasing number of running instances. Auto scaling model uses a statistics based on a standard metrics like CPU Utilization or a custom metrics like execution time of selected business service. By horizontal scaling, the model should satisfy Quality of Service requirements (QoS). QoS requirements are determined by criteria based on statistics defined on metrics. The auto scaling model should minimize the cost (mainly measured by the number of used instances) subject to an assumed QoS requirements. There are many reactive (on current load) and predictive (future load) approaches to the model of auto scaling. In this paper we propose some extensions to the concrete reactive auto scaling model to improve sensitivity to load changes. We introduce the extension which varying threshold of CPU Utilization in scaling-out policy. We extend the model by introducing randomized method in scaling-in policy.

Author supplied keywords

Cite

CITATION STYLE

APA

Augustyn, D. R. (2017). Improvements of the reactive auto scaling method for cloud platform. In Communications in Computer and Information Science (Vol. 718, pp. 422–431). Springer Verlag. https://doi.org/10.1007/978-3-319-59767-6_33

Improvements of the reactive auto scaling method for cloud platform

Abstract

Author supplied keywords

Cite

Register to see more suggestions