Models for identifying depression using social media text exhibit biases towards different gender and racial/ethnic groups. Factors like representation and balance of groups within the dataset are contributory factors, but difference in content and social media use may further explain these biases. We present an analysis of the content of social media posts from different demographic groups. Our analysis shows that there are content differences between depression and control subgroups across demographic groups, and that temporal topics and demographic-specific topics are correlated with downstream depression model error. We discuss the implications of our work on creating future datasets, as well as designing and training models for mental health.
CITATION STYLE
Aguirre, C., & Dredze, M. (2021). Qualitative Analysis of Depression Models by Demographics. In Computational Linguistics and Clinical Psychology: Improving Access, CLPsych 2021 - Proceedings of the 7th Workshop, in conjunction with NAACL 2021 (pp. 169–180). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2021.clpsych-1.19
Mendeley helps you to discover research relevant for your work.