Improving Low-Resource Cross-lingual Parsing with Expected Statistic Regularization

3Citations
Citations of this article
13Readers
Mendeley users who have this article in their library.

Abstract

We present Expected Statistic Regulariza tion (ESR), a novel regularization technique that utilizes low-order multi-task structural statistics to shape model distributions for semi-supervised learning on low-resource datasets. We study ESR in the context of cross-lingual transfer for syntactic analysis (POS tagging and labeled dependency parsing) and present several classes of low-order statistic functions that bear on model behavior. Experimentally, we evaluate the proposed statistics with ESR for unsupervised transfer on 5 diverse target languages and show that all statistics, when estimated accurately, yield improvements to both POS and LAS, with the best statistic improving POS by +7.0 and LAS by +8.5 on average. We also present semi-supervised transfer and learning curve experiments that show ESR provides significant gains over strong cross-lingual-transfer-plus-fine-tuning baselines for modest amounts of label data. These results indicate that ESR is a promising and complementary approach to model-transfer approaches for cross-lingual parsing.1.

Cite

CITATION STYLE

APA

Effland, T., & Collins, M. (2023). Improving Low-Resource Cross-lingual Parsing with Expected Statistic Regularization. Transactions of the Association for Computational Linguistics, 11, 122–138. https://doi.org/10.1162/tacl_a_00537

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free