Using unstructured profile information for gender classification of portuguese and english twitter users

3Citations
Citations of this article
5Readers
Mendeley users who have this article in their library.
Get full text

Abstract

This paper reports experiments on automatically detecting the gender of Twitter users, based on unstructured information found on their Twitter profile. A set of features previously proposed is evaluated on two datasets of English and Portuguese users, and their performance is assessed using several supervised and unsupervised approaches, including Naive Bayes variants, Logistic Regression, Support Vector Machines, Fuzzy c-Means clustering, and k-means. Results show that features perform well in both languages separately, but even best results were achieved when combining both languages. Supervised approaches reached 97.9% accuracy, but Fuzzy c-Means also proved suitable for this task achieving 96.4% accuracy.

Cite

CITATION STYLE

APA

Vicente, M., Carvalho, J. P., & Batista, F. (2015). Using unstructured profile information for gender classification of portuguese and english twitter users. In Communications in Computer and Information Science (Vol. 563, pp. 57–64). Springer Verlag. https://doi.org/10.1007/978-3-319-27653-3_6

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free