Exploiting parallel texts for word sense disambiguation: An empirical study

Hwee Tou Ng; Bin Wang; Yee Seng Chan

Conference ProceedingsOPEN ACCESS

Exploiting parallel texts for word sense disambiguation: An empirical study

Proceedings of the Annual Meeting of the Association for Computational Linguistics (2003) 2003-July

ISSN: 0736587X

109Citations

121Readers

Abstract

A central problem of word sense disambiguation (WSD) is the lack of manually sense-tagged data required for supervised learning. In this paper, we evaluate an approach to automatically acquire sense-tagged training data from English-Chinese parallel corpora, which are then used for disambiguating the nouns in the SENSEVAL-2 English lexical sample task. Our investigation reveals that this method of acquiring sense-tagged data is promising. On a subset of the most difficult SENSEVAL-2 nouns, the accuracy difference between the two approaches is only 14.0%, and the difference could narrow further to 6.5% if we disregard the advantage that manually sense-tagged data have in their sense coverage. Our analysis also highlights the importance of the issue of domain dependence in evaluating WSD programs.

Cite

CITATION STYLE

APA

Ng, H. T., Wang, B., & Chan, Y. S. (2003). Exploiting parallel texts for word sense disambiguation: An empirical study. In Proceedings of the Annual Meeting of the Association for Computational Linguistics (Vol. 2003-July). Association for Computational Linguistics (ACL).

Exploiting parallel texts for word sense disambiguation: An empirical study

Abstract

Cite

Register to see more suggestions