Semi-Supervised Learning with Data Augmentation for Tabular Data

14Citations
Citations of this article
11Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Data augmentation-based semi-supervised learning (SSL) methods have made great progress in computer vision and natural language processing areas. One of the most important factors is that the semantic structure invariance of these data allows the augmentation procedure (e.g., rotating images or masking words) to thoroughly utilize the enormous amount of unlabeled data. However, the tabular data does not possess an obvious invariant structure, and therefore similar data augmentation methods do not apply to it. To fill this gap, we present a simple yet efficient data augmentation method particular designed for tabular data and apply it to the SSL algorithm: SDAT (Semi-supervised learning with Data Augmentation for Tabular data). We adopt a multi-task learning framework that consists of two components: the data augmentation procedure and the consistency training procedure. The data augmentation procedure which perturbs in latent space employs a variational auto-encoder (VAE) to generate the reconstructed samples as augmented samples. The consistency training procedure constrains the predictions to be invariant between the augmented samples and the corresponding original samples. By sharing a representation network (encoder), we jointly train the two components to improve effectiveness and efficiency. Extensive experimental studies validate the effectiveness of the proposed method on the tabular datasets.

Cite

CITATION STYLE

APA

Fang, J., Tang, C., Cui, Q., Zhu, F., Li, L., Zhou, J., & Zhu, W. (2022). Semi-Supervised Learning with Data Augmentation for Tabular Data. In International Conference on Information and Knowledge Management, Proceedings (pp. 3928–3932). Association for Computing Machinery. https://doi.org/10.1145/3511808.3557699

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free