Exacerbating Algorithmic Bias through Fairness Attacks

Ninareh Mehrabi; Muhammad Naveed; Fred Morstatter; Aram Galstyan

Conference ProceedingsOPEN ACCESS

Exacerbating Algorithmic Bias through Fairness Attacks

35th AAAI Conference on Artificial Intelligence, AAAI 2021 (2021) 10B 8930-8938

DOI: 10.1609/aaai.v35i10.17080

32Citations

54Readers

Abstract

Algorithmic fairness has attracted significant attention in recent years, with many quantitative measures suggested for characterizing the fairness of different machine learning algorithms. Despite this interest, the robustness of those fairness measures with respect to an intentional adversarial attack has not been properly addressed. Indeed, most adversarial machine learning has focused on the impact of malicious attacks on the accuracy of the system, without any regard to the system’s fairness. We propose new types of data poisoning attacks where an adversary intentionally targets the fairness of a system. Specifically, we propose two families of attacks that target fairness measures. In the anchoring attack, we skew the decision boundary by placing poisoned points near specific target points to bias the outcome. In the influence attack on fairness, we aim to maximize the covariance between the sensitive attributes and the decision outcome and affect the fairness of the model. We conduct extensive experiments that indicate the effectiveness of our proposed attacks.

Cite

CITATION STYLE

APA

Mehrabi, N., Naveed, M., Morstatter, F., & Galstyan, A. (2021). Exacerbating Algorithmic Bias through Fairness Attacks. In 35th AAAI Conference on Artificial Intelligence, AAAI 2021 (Vol. 10B, pp. 8930–8938). Association for the Advancement of Artificial Intelligence. https://doi.org/10.1609/aaai.v35i10.17080

Exacerbating Algorithmic Bias through Fairness Attacks

Abstract

Cite

Register to see more suggestions