Cross-Lingual Document Retrieval with Smooth Learning

8Citations
Citations of this article
67Readers
Mendeley users who have this article in their library.

Abstract

Cross-lingual document search is an information retrieval task in which the queries’ language differs from the documents’ language. In this paper, we study the instability of neural document search models and propose a novel end-to-end robust framework that achieves improved performance in cross-lingual search with different documents’ languages. This framework includes a novel measure of the relevance, smooth cosine similarity, between queries and documents, and a novel loss function, Smooth Ordinal Search Loss, as the objective. We further provide theoretical guarantee on the generalization error bound for the proposed framework. We conduct experiments to compare our approach with other document search models, and observe significant gains under commonly used ranking metrics on the cross-lingual document retrieval task in a variety of languages.

Cite

CITATION STYLE

APA

Liu, J., Zhang, X., Goldwasser, D., & Wang, X. (2020). Cross-Lingual Document Retrieval with Smooth Learning. In COLING 2020 - 28th International Conference on Computational Linguistics, Proceedings of the Conference (pp. 3616–3629). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2020.coling-main.323

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free