The parameters of any machine learning (ML) model are obtained from the dataset on which the model is trained. However, existing research reveals that many datasets appear to have strong build-in biases. These biases are inherently learned by the learning mechanism of the ML model which adversely affects their generalization performance. In this research, we propose a new supervised data augmentation mechanism which we call as Data Augmentation Pursuit (DAP). The DAP generates labelled synthetic data instances for augmenting the raw datasets. To demonstrate the effectiveness of utilizing DAP for reducing model bias, we perform comprehensive experiments on real world image dataset. CNN models trained on augmented dataset obtained using DAP achieves significantly better classification performance and exhibits reduction in the bias learned by their learning mechanism.
CITATION STYLE
Verma, S., Wang, C., Zhu, L., & Liu, W. (2019). Towards Effective Data Augmentations via Unbiased GAN Utilization. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 11672 LNAI, pp. 555–567). Springer Verlag. https://doi.org/10.1007/978-3-030-29894-4_45
Mendeley helps you to discover research relevant for your work.