TrustAL: Trustworthy Active Learning Using Knowledge Distillation

4Citations
Citations of this article
21Readers
Mendeley users who have this article in their library.

Abstract

Active learning can be defined as iterations of data labeling, model training, and data acquisition, until sufficient labels are acquired. A traditional view of data acquisition is that, through iterations, knowledge from human labels and models is implicitly distilled to monotonically increase the accuracy and label consistency. Under this assumption, the most recently trained model is a good surrogate for the current labeled data, from which data acquisition is requested based on uncertainty/diversity. Our contribution is debunking this myth and proposing a new objective for distillation. First, we found example forgetting, which indicates the loss of knowledge learned across iterations. Second, for this reason, the last model is no longer the best teacher- For mitigating such forgotten knowledge, we select one of its predecessor models as a teacher, by our proposed notion of “consistency”. We show that this novel distillation is distinctive in the following three aspects; First, consistency ensures to avoid forgetting labels. Second, consistency improves both uncertainty/diversity of labeled data. Lastly, consistency redeems defective labels produced by human annotators.

Cite

CITATION STYLE

APA

Kwak, B. W., Kim, Y., Kim, Y. J., Hwang, S. W., & Yeo, J. (2022). TrustAL: Trustworthy Active Learning Using Knowledge Distillation. In Proceedings of the 36th AAAI Conference on Artificial Intelligence, AAAI 2022 (Vol. 36, pp. 7263–7271). Association for the Advancement of Artificial Intelligence. https://doi.org/10.1609/aaai.v36i7.20688

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free