Learning age and gender using co-occurrence of non-dictionary words from stylistic variations

5Citations
Citations of this article
3Readers
Mendeley users who have this article in their library.
Get full text

Abstract

This work attempts to report the stylistic differences in blogging for gender and age group variations using slang word co-occurrences. We have mainly focused on co-occurrence of non dictionary words across bloggers of different gender and age groups. For this analysis, we have focused on the feature use of slang words to study the stylistic variations of bloggers across various age groups and gender. We have modeled the co-occurrences of slang words used by bloggers as graph based model where nodes are slang words and edges represent the number of cooccurrences and studied the variations in predicting age groups and gender. We have used demographically tagged blog corpus from ICWSM Spinner dataset for these experiments and used Naive Bayes classifier with 10 fold cross validations. Preliminary results shows that the concurrence of of slang words could be a better choice for predicting age and gender. © 2010 Springer-Verlag Berlin Heidelberg.

Cite

CITATION STYLE

APA

Prasath, R. R. (2010). Learning age and gender using co-occurrence of non-dictionary words from stylistic variations. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 6086 LNAI, pp. 544–550). https://doi.org/10.1007/978-3-642-13529-3_58

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free