Bilingually learning word senses for translation

0Citations
Citations of this article
1Readers
Mendeley users who have this article in their library.
Get full text

Abstract

All words in every natural language are ambiguous, specially when translation is at stake. In translation tasks, there is the need for finding out adequate translations for such words in the contexts where they occur. In this article, a bilingual strategy to cluster words according to their meanings is described. A publicly available parallel corpora sentence aligned is used. Word senses are discriminated by their translations and by the words occurring in a window, both in the source and target language parallel sentences. This strategy is language independent and uses a correlation algorithm for filtering out irrelevant features. Clusters obtained were evaluated in terms of F-measure (getting an average rating of 94%) and their homogeneity and completeness was determined using V-Measure (getting an average rating of 83%). Learned clusters are then used to train a support vector machine to tag ambiguous words with their translations in the contexts where they occur. This task was also evaluated in terms of F-measure and confronted with a baseline. © 2014 Springer-Verlag Berlin Heidelberg.

Cite

CITATION STYLE

APA

Casteleiro, J., Lopes, G. P., & Silva, J. (2014). Bilingually learning word senses for translation. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 8404 LNCS, pp. 283–295). Springer Verlag. https://doi.org/10.1007/978-3-642-54903-8_24

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free