Abstract
We compare different strategies to apply statistical machine translation techniques in order to retrieve documents that are a plausible translation of a given source document. Finding the translated version of a document is a relevant task; for example, when building a corpus of parallel texts that can help to create and evaluate new machine translation systems. In contrast to the traditional settings in cross-language information retrieval tasks, in this case both the source and the target text are long and, thus, the procedure used to select which words or phrases will be included in the query has a key effect on the retrieval performance. In the statistical approach explored here, both the probability of the translation and the relevance of the terms are taken into account in order to build an effective query. Copyright © 2011 Taylor & Francis Group, LLC.
Cite
CITATION STYLE
Sánchez-Martínez, F., & Carrasco, R. C. (2011). Document translation retrieval based on statistical machine translation techniques. Applied Artificial Intelligence, 25(5), 329–340. https://doi.org/10.1080/08839514.2011.559906
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.