Factors affecting cloud data-center efficiency: A scheduling algorithm-based analysis

3Citations
Citations of this article
8Readers
Mendeley users who have this article in their library.

Abstract

Cloud computing encompasses two massively scalable services: computing capability and data storage space, which are provided by a massive number of machines and clusters. The increased use of big data has resulted in adopting a wide range of analytics engines, such as Hadoop. As a result, Hadoop has gained widespread acceptance as a data analytics platform. Over the past decade, Hadoop's ability to schedule tasks has become a critical aspect of system performance. Numerous researchers have presented various scheduling methods in their work to address the complex issue of performance degradation. However, few studies have been conducted to date to evaluate the effectiveness of these methods. By employing the PRISMA approach for searching and selecting papers, we examine the design choices that went into various Hadoop scheduling techniques proposed between 2008 and 2021. We present a taxonomy for succinctly categorising these scheduling techniques. Additionally, we evaluate methodologies based on a variety of performance metrics. Our search identified 82 studies relevant to this domain, all of which came from high-quality conferences, journals, symposiums, and workshops. This systematic study discusses various dynamic, constrained, and adaptive scheduling meth ods and their primary motivations, including makespan, data control, deadline, resource utilisation, load balancing, fairness, energy efficiency, and failure recovery. There is also a discussion of some unresolved issues and potential future directions for modifying existing studies. This study conducts a systematic review of the literature to identify and discuss the most critical factors affecting Hadoop scheduler performance and provide a roadmap for researchers working in this field. Finally, we intend to expand on the qualitative analysis conducted thus far and give the experts additional recommendations to conduct future cloud scheduling research.

References Powered by Scopus

The PRISMA 2020 statement: An updated guideline for reporting systematic reviews

45862Citations
N/AReaders
Get full text

Delay scheduling: A simple technique for achieving locality and fairness in cluster scheduling

1128Citations
N/AReaders
Get full text

ARIA: Automatic resource inference and allocation for mapreduce environments

354Citations
N/AReaders
Get full text

Cited by Powered by Scopus

An Adaptive Gradient Boosting Model for the Prediction of Rainfall Using ID3 as a Base Estimator

14Citations
N/AReaders
Get full text

HARD VOTING META CLASSIFIER FOR DISEASE DIAGNOSIS USING MEAN DECREASE IN IMPURITY FOR TREE MODELS

6Citations
N/AReaders
Get full text

Semantically Query All (Squerall): A Scalable Framework to Analyze Data from Heterogeneous Sources at Different Levels of Granularity

0Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Shehloo, A. A., Butt, M. A., & Zaman, M. (2021). Factors affecting cloud data-center efficiency: A scheduling algorithm-based analysis. International Journal of Advanced Technology and Engineering Exploration. Accent Social and Welfare Society. https://doi.org/10.19101/IJATEE.2021.874313

Readers over time

‘21‘22‘23‘2401234

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 3

75%

Professor / Associate Prof. 1

25%

Readers' Discipline

Tooltip

Engineering 2

50%

Business, Management and Accounting 1

25%

Computer Science 1

25%

Save time finding and organizing research with Mendeley

Sign up for free
0