Improving Formality-Sensitive Machine Translation using Data-Centric Approaches and Prompt Engineering

8Citations
Citations of this article
17Readers
Mendeley users who have this article in their library.

Abstract

In this paper, we present the KU x Upstage team’s submission for the Special Task on Formality Control on Spoken Language Translation, which involves translating English into four languages with diverse grammatical formality markers. Our methodology comprises two primary components: 1) a language-specific data-driven approach, and 2) the generation of synthetic data through the employment of large-scale language models and empirically-grounded prompt engineering. By adapting methodologies and models to accommodate the unique linguistic properties of each language, we observe a notable enhancement in performance relative to the baseline, substantiating the heightened efficacy of data-driven approaches. Moreover, our devised prompt engineering strategy yields superior synthetic translation instances.

Cite

CITATION STYLE

APA

Lee, S., Moon, H., Park, C., & Lim, H. (2023). Improving Formality-Sensitive Machine Translation using Data-Centric Approaches and Prompt Engineering. In 20th International Conference on Spoken Language Translation, IWSLT 2023 - Proceedings of the Conference (pp. 420–432). Association for Computational Linguistics. https://doi.org/10.18653/v1/2023.iwslt-1.40

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free