Real-time statistical speech translation

Krzysztof Wołk; Krzysztof Marasek

Conference Proceedings

Real-time statistical speech translation

Advances in Intelligent Systems and Computing (2014) 275 AISC(VOLUME 1) 107-113

DOI: 10.1007/978-3-319-05951-8_11

10Citations

5Readers

Get full text

Abstract

This research investigates the Statistical Machine Translation approaches to translate speech in real time automatically. Such systems can be used in a pipeline with speech recognition and synthesis software in order to produce a real-time voice communication system between foreigners. We obtained three main data sets from spoken proceedings that represent three different types of human speech. TED, Europarl, and OPUS parallel text corpora were used as the basis for training of language models, for developmental tuning and testing of the translation system. We also conducted experiments involving part of speech tagging, compound splitting, linear language model interpolation, TrueCasing and morphosyntactic analysis. We evaluated the effects of variety of data preparations on the translation results using the BLEU, NIST, METEOR and TER metrics and tried to give answer which metric is most suitable for PL-EN language pair. © Springer International Publishing Switzerland 2014.

Author supplied keywords

Cite

CITATION STYLE

APA

Wołk, K., & Marasek, K. (2014). Real-time statistical speech translation. In Advances in Intelligent Systems and Computing (Vol. 275 AISC, pp. 107–113). Springer Verlag. https://doi.org/10.1007/978-3-319-05951-8_11

Real-time statistical speech translation

Abstract

Author supplied keywords

Cite

Register to see more suggestions