Abstract
The rapid advancement of data generation techniques has spurred innovation across multiple domains. This comprehensive review delves into the realm of data generation methodologies, with a keen focus on statistical and machine learning-based approaches. Notably, novel strategies like the divide-and-conquer (DC) approach and cutting-edge models such as GANBLR have emerged to tackle a spectrum of challenges, spanning from preserving intricate data relationships to enhancing interpretability. Furthermore, the integration of generative adversarial networks (GANs) has sparked a revolution in data generation across sectors like healthcare, cybersecurity, and retail. This review meticulously examines how these techniques mitigate issues such as class imbalance, data scarcity, and privacy concerns. Through a meticulous analysis of evaluation metrics and diverse applications, it underscores the efficacy and potential of synthetic data in refining predictive models and decision-making software. Concluding with insights into prospective research trajectories and the evolving role of synthetic data in propelling machine learning and data-driven solutions across disciplines, this work provides a holistic understanding of the transformative power of contemporary data generation methodologies.
Author supplied keywords
Cite
CITATION STYLE
Papadaki, E., Vrahatis, A. G., & Kotsiantis, S. (2024, May 1). Exploring Innovative Approaches to Synthetic Tabular Data Generation. Electronics (Switzerland). Multidisciplinary Digital Publishing Institute (MDPI). https://doi.org/10.3390/electronics13101965
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.