We describe two methods to improve SMT accuracy using shallow syntax information. First, we use chunks to refine the set of word alignments typically used as a starting point in SMT systems. Second, we extend an N -gram-based SMT system with chunk tags to better account for long-distance reorderings. Experiments are reported on an Arabic-English task showing significant improvements. A human error analysis indicates that long-distance reorderings are captured effectively.
CITATION STYLE
Crego, J. M., & Habash, N. (2008). Using shallow syntax information to improve word alignment and reordering for SMT. In 3rd Workshop on Statistical Machine Translation, WMT 2008 at the Annual Meeting of the Association for Computational Linguistics, ACL 2008 (pp. 53–61). Association for Computational Linguistics (ACL). https://doi.org/10.3115/1626394.1626401
Mendeley helps you to discover research relevant for your work.