An Evaluation of ChatGPT's Translation Accuracy Using BLEU Score

Mozhgan Ghassemiazghandi

Journal ArticleOPEN ACCESS

An Evaluation of ChatGPT's Translation Accuracy Using BLEU Score

Ghassemiazghandi M

Theory and Practice in Language Studies (2024) 14(4) 985-994

DOI: 10.17507/tpls.1404.07

27Citations

90Readers

Get full text

Abstract

Traditional views have long held that machine translation cannot achieve the quality and accuracy of human translators, especially in complex language pairs like Persian and English. This study challenges this perspective by demonstrating that ChatGPT-4, with access to vast amounts of multilingual data and leveraging advanced large language model algorithms, significantly outperforms widely utilized open-source machine translation tools and approaches the realm of human translation quality. This research aims to critically assess the translation accuracy of ChatGPT-4 against a traditional open-source machine translation tool from Persian to English, highlighting the advancements in artificial intelligence-driven translation technologies. Using Bilingual Evaluation Understudy scores for a comprehensive evaluation, this study compares the translation outputs from ChatGPT-4 with MateCat, providing a quantitative basis for comparing their accuracy and quality. ChatGPT-4 achieves a BLUE score of 0.88 and an accuracy of 0.68, demonstrating superior performance compared to MateCat, with a 0.82 BLUE score and 0.49 accuracy. The results indicate that the translations generated by ChatGPT-4 surpass those produced by MateCat and nearly mirror the quality of human translations. The evaluation demonstrates the effectiveness of OpenAI's large language model algorithms in improving translation accuracy.

Author supplied keywords

Cite

CITATION STYLE

APA

Ghassemiazghandi, M. (2024). An Evaluation of ChatGPT’s Translation Accuracy Using BLEU Score. Theory and Practice in Language Studies, 14(4), 985–994. https://doi.org/10.17507/tpls.1404.07

An Evaluation of ChatGPT's Translation Accuracy Using BLEU Score

Abstract

Author supplied keywords

Cite

Register to see more suggestions