Abstract
Definition modelling is the task of automatically generating a dictionary-style definition given a target word. In this paper, we consider cross-lingual definition generation. Specifically, we generate English definitions for Wolastoqey (Malecite-Passamaquoddy) words. Wolastoqey is an endangered, low-resource polysynthetic language. We hypothesize that sub-word representations based on byte pair encoding (Sennrich et al., 2016) can be leveraged to represent morphologically-complex Wolastoqey words and overcome the challenge of not having large corpora available for training. Our experimental results demonstrate that this approach outperforms baseline methods in terms of BLEU score.
Cite
CITATION STYLE
Bear, D., & Cook, P. (2021). Cross-Lingual Wolastoqey-English Definition Modelling. In International Conference Recent Advances in Natural Language Processing, RANLP (pp. 138–146). Incoma Ltd. https://doi.org/10.26615/978-954-452-072-4_017
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.