Solving Multi-class Imbalance Problems Using Improved Tabular GANs

Zakarya Farou; Liudmila Kopeikina; Tomáš Horváth

Conference Proceedings

Solving Multi-class Imbalance Problems Using Improved Tabular GANs

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2022) 13756 LNCS 527-539

DOI: 10.1007/978-3-031-21753-1_51

1Citations

3Readers

Get full text

Abstract

Multi-class imbalance problems are non-standard derivative data science problems. These problems are associated with the skewness in the data underlying distribution, which, in turn, raises numerous issues for conventional machine learning techniques. To address the lack of data in imbalance problems, we can either collect new data or oversample the underrepresented classes by synthesizing artificial data from original instances. This paper focuses on the latter and introduces two novel tabular GAN variants to handle multi-class imbalance problems. Empirical results on three datasets from the UCI repository demonstrated that the suggested approaches that use our proposed filtering algorithm based on neighboring rules improved the ability of the decision tree classification model to recognize underrepresented class instances, decreased the bias toward the majority class, and enhanced its generalization ability.

Author supplied keywords

Cite

CITATION STYLE

APA

Farou, Z., Kopeikina, L., & Horváth, T. (2022). Solving Multi-class Imbalance Problems Using Improved Tabular GANs. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 13756 LNCS, pp. 527–539). Springer Science and Business Media Deutschland GmbH. https://doi.org/10.1007/978-3-031-21753-1_51

Solving Multi-class Imbalance Problems Using Improved Tabular GANs

Abstract

Author supplied keywords

Cite

Register to see more suggestions