Abstract
We present the design and creation of a disability-first dataset, "BIV-Priv,"which contains 728 images and 728 videos of 14 private categories captured by 26 blind participants to support downstream development of artificial intelligence (AI) models. While best practices in dataset creation typically attempt to eliminate private content, some applications require such content for model development. We describe our approach in creating this dataset with private content in an ethical way, including using props rather than participants' own private objects and balancing multi-disciplinary perspectives (e.g., accessibility, privacy, computer vision) to meet the tangible metrics (e.g., diversity, category, amount of content) to support AI innovations. We observed challenges that our participants encountered during the data collection, including accessibility issues (e.g., understanding foreground vs. background object placement) and issues due to the sensitive nature of the content (e.g., discomfort in capturing some props such as condoms around family members).
Author supplied keywords
Cite
CITATION STYLE
Sharma, T., Stangl, A., Zhang, L., Tseng, Y. Y., Xu, I., Findlater, L., … Wang, Y. (2023). Disability-First Design and Creation of A Dataset Showing Private Visual Information Collected With People Who Are Blind. In Conference on Human Factors in Computing Systems - Proceedings. Association for Computing Machinery. https://doi.org/10.1145/3544548.3580922
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.