Two-level alignment by words and phrases based on syntactic information

Seonho Kim; Juntae Yoon; Dong Yul Ra

Journal Article

Two-level alignment by words and phrases based on syntactic information

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2004) 2945 309-320

DOI: 10.1007/978-3-540-24630-5_38

0Citations

2Readers

Get full text

Abstract

As a part of work on alignment of the English and Korean parallel corpus, this paper presents a statistical translation model incorporating linguistic knowledge of syntactic and phrasal information for better translations. For this, we propose three models: First, we incorporate syntactic information such as part of speech into the word-based lexical alignment. Based on this model, we propose the second model which finds phrasal correspondence in the parallel corpus. Phrasal mapping through chunk-based shallow parsing enables to settle mismatch of meaningful units in the two languages. Lastly, we develop a two-level alignment model by combining these two models in order to construct both the word and phrase-based translation model. Model parameters are automatically estimated from a set of bilingual sentence pairs by applying the EM algorithm. Experiments show that the structural relationship helps construct a better translation model for structurally different languages like Korean and English. © Springer-Verlag 2004.

Cite

CITATION STYLE

APA

Kim, S., Yoon, J., & Ra, D. Y. (2004). Two-level alignment by words and phrases based on syntactic information. Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2945, 309–320. https://doi.org/10.1007/978-3-540-24630-5_38

Two-level alignment by words and phrases based on syntactic information

Abstract

Cite

Register to see more suggestions