Previous work has used monolingual parallel corpora to extract and generate paraphrases. We show that this task can be done using bilingual parallel corpora, a much more commonly available resource. Using alignment techniques from phrasebased statistical machine translation, we show how paraphrases in one language can be identified using a phrase in another language as a pivot. We define a paraphrase probability that allows paraphrases extracted from a bilingual parallel corpus to be ranked using translation probabilities, and show how it can be refined to take contextual information into account. We evaluate our paraphrase extraction and ranking methods using a set of manual word alignments, and contrast the quality with paraphrases extracted from automatic alignments. © 2005 Association for Computational Linguistics.
CITATION STYLE
Bannard, C., & Callison-Burch, C. (2005). Paraphrasing with bilingual parallel corpora. In ACL-05 - 43rd Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference (pp. 597–604). Association for Computational Linguistics (ACL). https://doi.org/10.3115/1219840.1219914
Mendeley helps you to discover research relevant for your work.