Alleviate dataset shift problem in fine-grained entity typing with virtual adversarial training

8Citations
Citations of this article
18Readers
Mendeley users who have this article in their library.

Abstract

The recent success of Distant Supervision (DS) brings abundant labeled data for the task of fine-grained entity typing (FET) without human annotation. However, the heuristically generated labels inevitably bring a significant distribution gap, namely dataset shift, between the distantly labeled training set and the manually curated test set. Considerable efforts have been made to alleviate this problem from the label perspective by either intelligently denoising the training labels, or designing noise-aware loss functions. Despite their progress, the dataset shift can hardly be eliminated completely. In this work, complementary to the label perspective, we reconsider this problem from the model perspective: Can we learn a more robust typing model with the existence of dataset shift? To this end, we propose a novel regularization module based on virtual adversarial training (VAT). The proposed approach first uses a self-paced sample selection function to select suitable samples for VAT, then constructs virtual adversarial perturbations based on the selected samples, and finally regularizes the model to be robust to such perturbations. Experiments on two benchmarks demonstrate the effectiveness of the proposed method, with an average 3.8%, 2.5% and 3.2% improvement in accuracy, Macro F1 and Micro F1 respectively compared to the next best method.

Cite

CITATION STYLE

APA

Shi, H., Tang, S., Gu, X., Chen, B., Chen, Z., Shao, J., & Ren, X. (2020). Alleviate dataset shift problem in fine-grained entity typing with virtual adversarial training. In IJCAI International Joint Conference on Artificial Intelligence (Vol. 2021-January, pp. 3898–3904). International Joint Conferences on Artificial Intelligence. https://doi.org/10.24963/ijcai.2020/539

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free