Cross-Lingual Syntactic Transfer with Limited Resources

Mohammad Sadegh Rasooli; Michael Collins

Journal ArticleOPEN ACCESS

Cross-Lingual Syntactic Transfer with Limited Resources

Rasooli M
Collins M

Transactions of the Association for Computational Linguistics (2017) 5 279-293

DOI: 10.1162/tacl_a_00061

N/ACitations

104Readers

Abstract

We describe a simple but effective method for cross-lingual syntactic transfer of dependency parsers, in the scenario where a large amount of translation data is not available. This method makes use of three steps: 1) a method for deriving cross-lingual word clusters, which can then be used in a multilingual parser; 2) a method for transferring lexical information from a target language to source language treebanks; 3) a method for integrating these steps with the density-driven annotation projection method of Rasooli and Collins (2015). Experiments show improvements over the state-of-the-art in several languages used in previous work, in a setting where the only source of translation data is the Bible, a considerably smaller corpus than the Europarl corpus used in previous work. Results using the Europarl corpus as a source of translation data show additional improvements over the results of Rasooli and Collins (2015). We conclude with results on 38 datasets from the Universal Dependencies corpora.

Cite

CITATION STYLE

APA

Rasooli, M. S., & Collins, M. (2017). Cross-Lingual Syntactic Transfer with Limited Resources. Transactions of the Association for Computational Linguistics, 5, 279–293. https://doi.org/10.1162/tacl_a_00061

Cross-Lingual Syntactic Transfer with Limited Resources

Abstract

Cite

Register to see more suggestions