Solving Multi-class Imbalance Problems Using Improved Tabular GANs

1Citations
Citations of this article
3Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Multi-class imbalance problems are non-standard derivative data science problems. These problems are associated with the skewness in the data underlying distribution, which, in turn, raises numerous issues for conventional machine learning techniques. To address the lack of data in imbalance problems, we can either collect new data or oversample the underrepresented classes by synthesizing artificial data from original instances. This paper focuses on the latter and introduces two novel tabular GAN variants to handle multi-class imbalance problems. Empirical results on three datasets from the UCI repository demonstrated that the suggested approaches that use our proposed filtering algorithm based on neighboring rules improved the ability of the decision tree classification model to recognize underrepresented class instances, decreased the bias toward the majority class, and enhanced its generalization ability.

Cite

CITATION STYLE

APA

Farou, Z., Kopeikina, L., & Horváth, T. (2022). Solving Multi-class Imbalance Problems Using Improved Tabular GANs. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 13756 LNCS, pp. 527–539). Springer Science and Business Media Deutschland GmbH. https://doi.org/10.1007/978-3-031-21753-1_51

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free