SCARL: Attentive reinforcement learning-based scheduling in a multi-resource heterogeneous cluster

Mukoe Cheong; Hyunsung Lee; Ikjun Yeom; Honguk Woo

Journal ArticleOPEN ACCESS

SCARL: Attentive reinforcement learning-based scheduling in a multi-resource heterogeneous cluster

IEEE Access (2019) 7 153432-153444

DOI: 10.1109/ACCESS.2019.2948150

25Citations

21Readers

Abstract

Advanced reinforcement learning (RL) technologies have recently increased the opportunity for automating several tasks in cluster management at scale by exploiting repetitive logs of cluster operation and building a learning model for resource allocation and job scheduling. Yet, this trend of adopting RL in the domain of cluster management has not fully addressed the diversity and heterogeneity of jobs and machines in modern cluster environments. In this paper, we present an RL-based scheduler for a multi-resource cluster, namely SCARL (SCheduler with Attentive Reinforcement Learning), concentrating on intricate cluster operating conditions with different resource requirements and capabilities. Specifically, we employ attentive embedding and factored-action scheduling that together efficiently incorporate time-varying interdependency of jobs and machines in RL processing; they enable an end-to-end scalable policy for scheduling diverse jobs on heterogeneous machines. To the best of our knowledge, we are the first to employ attention mechanism in RL-based cluster resource management. Through experiments, we demonstrate that our approach is competitive with existing heuristic methods under various cluster simulation configurations, e.g., an average 9.2 % enhancement in slowdown over the shortest job first algorithm. Additionally, the approach yields stable performance with our test cluster for running synthetic workloads based on real traces.

Author supplied keywords

Cite

CITATION STYLE

APA

Cheong, M., Lee, H., Yeom, I., & Woo, H. (2019). SCARL: Attentive reinforcement learning-based scheduling in a multi-resource heterogeneous cluster. IEEE Access, 7, 153432–153444. https://doi.org/10.1109/ACCESS.2019.2948150

SCARL: Attentive reinforcement learning-based scheduling in a multi-resource heterogeneous cluster

Abstract

Author supplied keywords

Cite

Register to see more suggestions