We present two methods for estimating replacement probabilities without using parallel corpora. The first method proposed exploits the possible translation probabilities latent in Machine Readable Dictionaries (MRD). The second method is more robust, and exploits context similarity-based techniques in order to estimate word translation probabilities using the Internet as a bilingual comparable corpus. The experiments show a statistically significant improvement over non weighted structured queries in terms of MAP by using the replacement probabilities obtained with the proposed methods. The context similarity-based method is the one that yields the most significant improvement. © 2010 Springer-Verlag Berlin Heidelberg.
CITATION STYLE
Saralegi, X., & De Lacalle, M. L. (2010). Estimating translation probabilities from the web for structured queries on CLIR. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 5993 LNCS, pp. 586–589). Springer Verlag. https://doi.org/10.1007/978-3-642-12275-0_53
Mendeley helps you to discover research relevant for your work.