Abstract
In parataxis languages like Chinese, word meanings are constructed using specific wordformations, which can help to disambiguate word senses. However, such knowledge is rarely explored in previous word sense disambiguation (WSD) methods. In this paper, we propose to leverage word-formation knowledge to enhance Chinese WSD. We first construct a large-scale Chinese lexical sample WSD dataset with word-formations. Then, we propose a model FormBERT to explicitly incorporate word-formations into sense disambiguation. To further enhance generalizability, we design a word-formation predictor module in case word-formation annotations are unavailable. Experimental results show that our method brings substantial performance improvement over strong baselines.
Cite
CITATION STYLE
Zheng, H., Li, L., Dai, D., Chen, D., Liu, T., Sun, X., & Liu, Y. (2021). Leveraging Word-Formation Knowledge for Chinese Word Sense Disambiguation. In Findings of the Association for Computational Linguistics, Findings of ACL: EMNLP 2021 (pp. 918–923). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2021.findings-emnlp.78
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.