Resource distribution estimation for Data-Intensive workloads: Give me my share & no one gets hurt!

Alireza Khoshkbarforoushha; Rajiv Ranjan; Peter Strazdins

Conference Proceedings

Resource distribution estimation for Data-Intensive workloads: Give me my share & no one gets hurt!

Communications in Computer and Information Science (2016) 567 228-237

DOI: 10.1007/978-3-319-33313-7_17

4Citations

7Readers

Get full text

Abstract

Robust resource share estimation of data-intensive workloads is integral to efficient workload management in a (virtualized) cluster where multiple systems co-exist and share the same infrastructure. However, developing a reliable resource estimator is quite challenging due to (i) heterogeneity of workloads (e.g. stream processing, batch processing, transactional, etc.) in a multi-system shared cluster, (ii) limited (in batch processing) or complete uncertainties (in stream processing) on input data size or arrival rates, and (iii) changing configurations from run to run. To address above challenges, we propose an inclusive framework and related techniques for workload profiling, similar job identification, and resource distribution prediction in a cluster. Our analysis shows that the framework can successfully estimate the whole spectrum of resource usage as probability distribution functions for wide ranges of data-intensive workloads.

Author supplied keywords

Cite

CITATION STYLE

APA

Khoshkbarforoushha, A., Ranjan, R., & Strazdins, P. (2016). Resource distribution estimation for Data-Intensive workloads: Give me my share & no one gets hurt! In Communications in Computer and Information Science (Vol. 567, pp. 228–237). Springer Verlag. https://doi.org/10.1007/978-3-319-33313-7_17

Resource distribution estimation for Data-Intensive workloads: Give me my share & no one gets hurt!

Abstract

Author supplied keywords

Cite

Register to see more suggestions