Autonomic Workload Performance Modeling for Large-Scale Databases and Data Warehouses through Deep Belief Network with Data Augmentation Using Conditional Generative Adversarial Networks

2Citations
Citations of this article
37Readers
Mendeley users who have this article in their library.

Abstract

Databases and warehouses are experiencing workload of different types such as Decision Support System (DSS), Online Transaction Processing (OLTP) and Mixed workloads. Handling variety of workload in autonomic systems is a critical task. After self-configuring the workload, the next challenge is workload performance tuning that motives towards self-predictive systems. Existing studies provide performance modeling solutions on small-scale data repositories of Database Management System (DBMS) and Data Warehouse (DWH) using either classical eager or lazy learning approaches. However, in real-world problems, we normally have to deal with large-scale data repositories. Therefore, there is a need to develop performance models that provide data augmentation to solve large-scale data repositories that are not publicly available. In this study, deep learning approaches have been investigated for performance tuning of large-scale data repositories. We propose a performance prediction model called Optimized GAN-based Deep Learning (OGDL) model. For data augmentation, Conditional Generative Adversarial Networks (CGAN) is applied. For autonomic perspective, we incorporated MAPE-K model to manage the workload autonomically. Different deep learning models are applied, and it was observed that Deep Belief Network (DBN) performed better as compared to other deep learning models such as Deep Neural Network (DNN). We performed a number of experiments and from results it is observed that deep learning models performed the best in comparison with classical machine learning and lazy learning and a 6-8\% increase in accuracy is recorded in our experiments using DBN. The proposed OGDL model performed the best in workload performance predictions in an optimized way.

Cite

CITATION STYLE

APA

Shaheen, N., Raza, B., Shahid, A. R., & Malik, A. K. (2021). Autonomic Workload Performance Modeling for Large-Scale Databases and Data Warehouses through Deep Belief Network with Data Augmentation Using Conditional Generative Adversarial Networks. IEEE Access, 9, 97603–97620. https://doi.org/10.1109/ACCESS.2021.3096039

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free