BIT’s System for Multilingual Track

1Citations
Citations of this article
12Readers
Mendeley users who have this article in their library.
Get full text

Abstract

This paper describes the system we submitted to the IWSLT 2023 multilingual speech translation track, with the input is speech from one language, and the output is text from 10 target languages. Our system consists of CNN and Transformer, convolutional neural networks downsample speech features and extract local information, while transformer extract global features and output the final results. In our system, we use speech recognition tasks to pre-train encoder parameters, and then use speech translation corpus to train the multilingual speech translation model. We have also adopted other methods to optimize the model, such as data augmentation, model ensemble, etc. Our system can obtain satisfactory results on test sets of 10 languages in the MUST-C corpus.

Cite

CITATION STYLE

APA

Wang, Z., Guo, Y., & Chen, S. (2023). BIT’s System for Multilingual Track. In 20th International Conference on Spoken Language Translation, IWSLT 2023 - Proceedings of the Conference (pp. 455–460). Association for Computational Linguistics. https://doi.org/10.18653/v1/2023.iwslt-1.44

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free