Paraphrasing with bilingual parallel corpora

418Citations
Citations of this article
267Readers
Mendeley users who have this article in their library.

Abstract

Previous work has used monolingual parallel corpora to extract and generate paraphrases. We show that this task can be done using bilingual parallel corpora, a much more commonly available resource. Using alignment techniques from phrasebased statistical machine translation, we show how paraphrases in one language can be identified using a phrase in another language as a pivot. We define a paraphrase probability that allows paraphrases extracted from a bilingual parallel corpus to be ranked using translation probabilities, and show how it can be refined to take contextual information into account. We evaluate our paraphrase extraction and ranking methods using a set of manual word alignments, and contrast the quality with paraphrases extracted from automatic alignments. © 2005 Association for Computational Linguistics.

Cite

CITATION STYLE

APA

Bannard, C., & Callison-Burch, C. (2005). Paraphrasing with bilingual parallel corpora. In ACL-05 - 43rd Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference (pp. 597–604). Association for Computational Linguistics (ACL). https://doi.org/10.3115/1219840.1219914

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free