Fairness Testing of Machine Translation Systems

Zeyu Sun; Zhenpeng Chen; Jie Zhang; Dan Hao

Journal ArticleOPEN ACCESS

Fairness Testing of Machine Translation Systems

ACM Transactions on Software Engineering and Methodology (2024) 33(6)

DOI: 10.1145/3664608

1Citations

17Readers

Abstract

Machine translation is integral to international communication and extensively employed in diverse human-related applications. Despite remarkable progress, fairness issues persist within current machine translation systems. In this article, we propose FairMT, an automated fairness testing approach tailored for machine translation systems. FairMT operates on the assumption that translations of semantically similar sentences, containing protected attributes from distinct demographic groups, should maintain comparable meanings. It comprises three key steps: (1) test input generation, producing inputs covering various demographic groups; (2) test oracle generation, identifying potential unfair translations based on semantic similarity measurements; and (3) regression, discerning genuine fairness issues from those caused by low-quality translation. Leveraging FairMT, we conduct an empirical study on three leading machine translation systems-Google Translate, T5, and Transformer. Our investigation uncovers up to 832, 1,984, and 2,627 unfair translations across the three systems, respectively. Intriguingly, we observe that fair translations tend to exhibit superior translation performance, challenging the conventional wisdom of a fairness-performance tradeoff prevalent in the fairness literature.

Author supplied keywords

References Powered by Scopus

View more at Scopus

Cite

CITATION STYLE

APA

Sun, Z., Chen, Z., Zhang, J., & Hao, D. (2024). Fairness Testing of Machine Translation Systems. ACM Transactions on Software Engineering and Methodology, 33(6). https://doi.org/10.1145/3664608

Readers over time

Readers' Seniority

Lecturer / Post doc 6

86%

PhD / Post grad / Masters / Doc 1

14%

Readers' Discipline

Computer Science 5

100%

Fairness Testing of Machine Translation Systems

Abstract

Author supplied keywords

References Powered by Scopus

A survey of named entity recognition and classification

Algorithmic decision making and the cost of fairness

A machine learning approach to coreference resolution of noun phrases

Register to see more suggestions

Cite

Readers over time

Readers' Seniority

Readers' Discipline