Abstract
Word sense disambiguation (WSD), which aims to determine an appropriate sense for a target word given its context, is crucial for natural language understanding. Existing supervised methods treat WSD as a classification task and have achieved remarkable performance. However, they ignore uncertainty estimation (UE) in the real-world setting, where the data is always noisy and out of distribution. This paper extensively studies UE on the benchmark designed for WSD. Specifically, we first compare four uncertainty scores for a state-of-the-art WSD model and verify that the conventional predictive probabilities obtained at the final layer of the model are inadequate to quantify uncertainty. Then, we examine the capability of capturing data and model uncertainties by the model with the selected UE score on well-designed test scenarios and discover that the model adequately reflects data uncertainty but underestimates model uncertainty. Furthermore, we explore numerous lexical properties that intrinsically affect data uncertainty and provide a detailed analysis of four critical aspects: the syntactic category, morphology, sense granularity, and semantic relations. The code is available at https://github.com/RyanLiut/WSD-UE.
Cite
CITATION STYLE
Liu, Z., & Liu, Y. (2023). Ambiguity Meets Uncertainty: Investigating Uncertainty Estimation for Word Sense Disambiguation. In Proceedings of the Annual Meeting of the Association for Computational Linguistics (pp. 3963–3977). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2023.findings-acl.245
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.