When one applies machine learning to a real-world problem, sometimes data imbalance makes a crucial impact on the resulting model’s performance. We propose to use generative adversarial network (GAN) to do data balancing through data augmentation in data preprocessing step of binary classification task. We train CycleGAN on unpaired images to be able to produce images from the opposite class for any given input image. After training we use it to produce images from the opposite class for every image in a given imbalanced dataset, thus making it fully-balanced. The proposed augmentation technique may be used as the preprocessing step in binary classification tasks. We show that it improves performance in pneumonia presence/absence classification task on X-ray images. We inspect how binary classifier performance changes if the dataset used for GAN training differs from the dataset we measure binary classifier performance on. We also inspect its behavior on several other binary classification tasks related to medical imaging.
CITATION STYLE
Malygina, T., Ericheva, E., & Drokin, I. (2019). Data augmentation with GAN: Improving Chest X-Ray pathologies prediction on class-imbalanced cases. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 11832 LNCS, pp. 321–334). Springer. https://doi.org/10.1007/978-3-030-37334-4_29
Mendeley helps you to discover research relevant for your work.