Obfuscating Gender in Social Media Writing

66Citations
Citations of this article
118Readers
Mendeley users who have this article in their library.

Abstract

The vast availability of textual data on social media has led to an interest in algorithms to predict user attributes such as gender based on the user's writing. These methods are valuable for social science research as well as targeted advertising and profiling, but also compromise the privacy of users who may not realize that their personal idiolects can give away their demographic identities. Can we automatically modify a text so that the author is classified as a certain target gender, under limited knowledge of the classifier, while preserving the text's fluency and meaning? We present a basic model to modify a text using lexical substitution, show empirical results with Twitter and Yelp data, and outline ideas for extensions.

Cite

CITATION STYLE

APA

Reddy, S., & Knight, K. (2016). Obfuscating Gender in Social Media Writing. In NLP + CSS 2016 - EMNLP 2016 Workshop on Natural Language Processing and Computational Social Science, Proceedings of the Workshop (pp. 17–26). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/w16-5603

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free