Developing a sentence level fairness metric using word embeddings

Ahmed Izzidien; Stephen Fitz; Peter Romero; Bao S. Loe; David Stillwell

Journal ArticleOPEN ACCESS

Developing a sentence level fairness metric using word embeddings

Izzidien A
Fitz S
Romero P
et al.

International Journal of Digital Humanities (2022) 5(2-3) 95-130

DOI: 10.1007/s42803-022-00049-4

N/ACitations

6Readers

Abstract

Fairness is a principal social value that is observable in civilisations around the world. Yet, a fairness metric for digital texts that describe even a simple social interaction, e.g., ‘The boy hurt the girl’ has not been developed. We address this by employing word embeddings that use factors found in a new social psychology literature review on the topic. We use these factors to build fairness vectors. These vectors are used as sentence level measures, whereby each dimension reflects a fairness component. The approach is employed to approximate human perceptions of fairness. The method leverages a pro-social bias within word embeddings, for which we obtain an F1 = 79.8 on a list of sentences using the Universal Sentence Encoder (USE). A second approach, using principal component analysis (PCA) and machine learning (ML), produces an F1 = 86.2. Repeating these tests using Sentence Bidirectional Encoder Representations from Transformers (SBERT) produces an F1 = 96.9 and F1 = 100 respectively. Improvements using subspace representations are further suggested. By proposing a first-principles approach, the paper contributes to the analysis of digital texts along an ethical dimension.

Cite

CITATION STYLE

APA

Izzidien, A., Fitz, S., Romero, P., Loe, B. S., & Stillwell, D. (2022). Developing a sentence level fairness metric using word embeddings. International Journal of Digital Humanities, 5(2–3), 95–130. https://doi.org/10.1007/s42803-022-00049-4

Developing a sentence level fairness metric using word embeddings

Abstract

Cite

Register to see more suggestions