Paraphrasing with bilingual parallel corpora

Colin Bannard; Chris Callison-Burch

Conference ProceedingsOPEN ACCESS

Paraphrasing with bilingual parallel corpora

ACL-05 - 43rd Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference (2005) 597-604

DOI: 10.3115/1219840.1219914

418Citations

267Readers

Abstract

Previous work has used monolingual parallel corpora to extract and generate paraphrases. We show that this task can be done using bilingual parallel corpora, a much more commonly available resource. Using alignment techniques from phrasebased statistical machine translation, we show how paraphrases in one language can be identified using a phrase in another language as a pivot. We define a paraphrase probability that allows paraphrases extracted from a bilingual parallel corpus to be ranked using translation probabilities, and show how it can be refined to take contextual information into account. We evaluate our paraphrase extraction and ranking methods using a set of manual word alignments, and contrast the quality with paraphrases extracted from automatic alignments. © 2005 Association for Computational Linguistics.

Cite

CITATION STYLE

APA

Bannard, C., & Callison-Burch, C. (2005). Paraphrasing with bilingual parallel corpora. In ACL-05 - 43rd Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference (pp. 597–604). Association for Computational Linguistics (ACL). https://doi.org/10.3115/1219840.1219914

Paraphrasing with bilingual parallel corpora

Abstract

Cite

Register to see more suggestions