Abstract
Influence functions (IFs) are a powerful tool for detecting anomalous examples in large scale datasets. However, they are unstable when applied to deep networks. In this paper, we provide an explanation for the instability of IFs and develop a solution to this problem. We show that IFs are unreliable when the two data points belong to two different classes. Our solution leverages class information to improve the stability of IFs. Extensive experiments show that our modification significantly improves the performance and stability of IFs while incurring no additional computational cost.
Cite
CITATION STYLE
Duc-Thang, N., Thanh-Tung, H., Tran, Q., Dang, H. T., Ngoc-Hieu, N., Dau, A., & Bui, N. (2023). Class based Influence Functions for Error Detection. In Proceedings of the Annual Meeting of the Association for Computational Linguistics (Vol. 2, pp. 1204–1218). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2023.acl-short.104
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.