Fair human-centric image dataset for ethical AI benchmarking

Alice Xiang; Jerone T.A. Andrews; Rebecca L. Bourke; William Thong; Julienne M. LaChance; Tiffany Georgievski; Apostolos Modas; Aida Rahmattalabbi; Yunhao Ba; Shruti Nagpal; Orestis Papakyriakopoulos; Dora Zhao; Jinru Xue; Victoria Matthews; Linxia Gong; Austin T. Hoag; Mircea Cimpoi; Swami Sankaranarayanan; Wiebke Hutiri; Morgan K. Scheuerman; Albert S. Abedi; Peter Stone; Peter R. Wurman; Hiroaki Kitano; Michael Spranger

Journal ArticleOPEN ACCESS

Fair human-centric image dataset for ethical AI benchmarking

Nature (2025) 648(8092) 97-108

DOI: 10.1038/s41586-025-09716-2

5Citations

47Readers

Abstract

Computer vision is central to many artificial intelligence (AI) applications, from autonomous vehicles to consumer devices. However, the data behind such technical innovations are often collected with insufficient consideration of ethical concerns1, 2–3. This has led to a reliance on datasets that lack diversity, perpetuate biases and are collected without the consent of data rights holders. These datasets compromise the fairness and accuracy of AI models and disenfranchise stakeholders4, 5, 6, 7–8. Although awareness of the problems of bias in computer vision technologies, particularly facial recognition, has become widespread9, the field lacks publicly available, consensually collected datasets for evaluating bias for most tasks3,10,11. In response, we introduce the Fair Human-Centric Image Benchmark (FHIBE, pronounced ‘Feebee’), a publicly available human image dataset implementing best practices for consent, privacy, compensation, safety, diversity and utility. FHIBE can be used responsibly as a fairness evaluation dataset for many human-centric computer vision tasks, including pose estimation, person segmentation, face detection and verification, and visual question answering. By leveraging comprehensive annotations capturing demographic and physical attributes, environmental factors, instrument and pixel-level annotations, FHIBE can identify a wide variety of biases. The annotations also enable more nuanced and granular bias diagnoses, enabling practitioners to better understand sources of bias and mitigate potential downstream harms. FHIBE therefore represents an important step forward towards trustworthy AI, raising the bar for fairness benchmarks and providing a road map for responsible data curation in AI.

Cite

CITATION STYLE

APA

Xiang, A., Andrews, J. T. A., Bourke, R. L., Thong, W., LaChance, J. M., Georgievski, T., … Spranger, M. (2025). Fair human-centric image dataset for ethical AI benchmarking. Nature, 648(8092), 97–108. https://doi.org/10.1038/s41586-025-09716-2

Fair human-centric image dataset for ethical AI benchmarking

Abstract

Cite

Register to see more suggestions