Abstract
This paper examins approaches for translation between English and morphology-rich languages. Experiment with English-Russian and English-Lithuanian revels that "pure" statistical approaches on 10 million word corpus gives unsatisfactory translation. Then, several Web-available linguistic resources are suggested for translation. Syntax parsers, bilingual and semantic dictionaries, bilingual parallel corpus and monolingualWeb-based corpus are integrated in one comprehensive statistical model. Multi-abstraction language representation is used for statistical induction of syntactic and semantic transformation rules called multi-alignment templates. The decodingmodel is described using the feature functions, a log-linear modeling approach and A* search algorithm. An evaluation of this approach is performed on the English-Lithuanian language pair. Presented experimental results demonstrates that the multi-abstraction approach and hybridization of learning methods can improve quality of translation. © 2008 Institute of Mathematics and Informatics.
Author supplied keywords
Cite
CITATION STYLE
Laukaitis, A., & Vasilecas, O. (2008). Multi-alignment templates induction. Informatica, 19(4), 535–554. https://doi.org/10.15388/informatica.2008.229
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.