Obfuscating Gender in Social Media Writing

Sravana Reddy; Kevin Knight

Conference ProceedingsOPEN ACCESS

Obfuscating Gender in Social Media Writing

NLP + CSS 2016 - EMNLP 2016 Workshop on Natural Language Processing and Computational Social Science, Proceedings of the Workshop (2016) 17-26

DOI: 10.18653/v1/w16-5603

66Citations

118Readers

Abstract

The vast availability of textual data on social media has led to an interest in algorithms to predict user attributes such as gender based on the user's writing. These methods are valuable for social science research as well as targeted advertising and profiling, but also compromise the privacy of users who may not realize that their personal idiolects can give away their demographic identities. Can we automatically modify a text so that the author is classified as a certain target gender, under limited knowledge of the classifier, while preserving the text's fluency and meaning? We present a basic model to modify a text using lexical substitution, show empirical results with Twitter and Yelp data, and outline ideas for extensions.

Cite

CITATION STYLE

APA

Reddy, S., & Knight, K. (2016). Obfuscating Gender in Social Media Writing. In NLP + CSS 2016 - EMNLP 2016 Workshop on Natural Language Processing and Computational Social Science, Proceedings of the Workshop (pp. 17–26). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/w16-5603

Obfuscating Gender in Social Media Writing

Abstract

Cite

Register to see more suggestions