Our participation in Bilingual Document Alignment shared task at WMT16 focuses on building a language-independent, scalable system for aligning documents based on content as opposed to using webpage meta information. The resulting system is capable of producing scored n-best lists of candidate pages and can therefore be adapted to tasks where either precision or recall is maximized. We conduct a series of experiments that show the effectiveness of the system without any specific tuning.
CITATION STYLE
Shchukin, V., Khristich, D., & Galinskaya, I. (2016). Word Clustering Approach to Bilingual Document Alignment (WMT 2016 Shared Task). In Proceedings of the Annual Meeting of the Association for Computational Linguistics (Vol. 2, pp. 740–744). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/w16-2376
Mendeley helps you to discover research relevant for your work.