A multilingual procedure for dictionary-based sentence alignment

Adam Meyers; Michiko Kosaka; Ralph Grishman

Conference Proceedings

A multilingual procedure for dictionary-based sentence alignment

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (1998) 1529 187-198

DOI: 10.1007/3-540-49478-2_18

13Citations

35Readers

Get full text

Abstract

This paper describes a sentence alignment technique based on a machine readable dictionary. Alignment takes place in a single pass through the text, based on the scores of matches between pairs of source and target sentences. Pairings consisting of sets of matches are evaluated using a version of the Gale-Shapely solution to the stable marriage problem. An algorithm is described which can handle N-to-1 (or 1-to-N) matches, for n ≥ 0, i.e., deletions, 1-to-1 (including scrambling), and 1-to-many matches. A simple frequency based method for acquiring supplemental dictionary entries is also discussed. We achieve high quality alignments using available bilingual dictionaries, both for closely related language pairs (Spanish/English) and more distantly related pairs (Japanese/English).

Cite

CITATION STYLE

APA

Meyers, A., Kosaka, M., & Grishman, R. (1998). A multilingual procedure for dictionary-based sentence alignment. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 1529, pp. 187–198). Springer Verlag. https://doi.org/10.1007/3-540-49478-2_18

A multilingual procedure for dictionary-based sentence alignment

Abstract

Cite

Register to see more suggestions