Abstract
Creating high-quality annotated data for task-oriented dialog (TOD) is known to be notoriously difficult, and the challenges are amplified when the goal is to create equitable, culturally adapted, and large-scale TOD datasets for multiple languages. Therefore, the current datasets are still very scarce and suffer from limitations such as translation-based non-native dialogs with translation artefacts, small scale, or lack of cultural adaptation, among others. In this work, we first take stock of the current landscape of multilingual TOD datasets, offering a systematic overview of their properties and limitations. Aiming to reduce all the detected limitations, we then introduce MULTI3WOZ, a novel multilingual, multi-domain, multi-parallel TOD dataset. It is large-scale and offers culturally adapted dialogs in 4 languages to enable training and evaluation of multilingual and cross-lingual TOD systems. We describe a complex bottom–up data collection process that yielded the final dataset, and offer the first sets of baseline scores across different TOD-related tasks for future reference, also highlighting its challenging nature.
Cite
CITATION STYLE
Hu, S., Zhou, H., Hergul, M., Gritta, M., Zhang, G., Iacobacci, I., … Korhonen, A. (2023). MULTI3WOZ: A Multilingual, Multi-Domain, Multi-Parallel Dataset for Training and Evaluating Culturally Adapted Task-Oriented Dialog Systems. Transactions of the Association for Computational Linguistics, 11, 1396–1415. https://doi.org/10.1162/tacl_a_00609
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.