Learning age and gender using co-occurrence of non-dictionary words from stylistic variations

R. Rajendra Prasath

Conference Proceedings

Learning age and gender using co-occurrence of non-dictionary words from stylistic variations

Prasath R

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2010) 6086 LNAI 544-550

DOI: 10.1007/978-3-642-13529-3_58

5Citations

3Readers

Get full text

Abstract

This work attempts to report the stylistic differences in blogging for gender and age group variations using slang word co-occurrences. We have mainly focused on co-occurrence of non dictionary words across bloggers of different gender and age groups. For this analysis, we have focused on the feature use of slang words to study the stylistic variations of bloggers across various age groups and gender. We have modeled the co-occurrences of slang words used by bloggers as graph based model where nodes are slang words and edges represent the number of cooccurrences and studied the variations in predicting age groups and gender. We have used demographically tagged blog corpus from ICWSM Spinner dataset for these experiments and used Naive Bayes classifier with 10 fold cross validations. Preliminary results shows that the concurrence of of slang words could be a better choice for predicting age and gender. © 2010 Springer-Verlag Berlin Heidelberg.

Author supplied keywords

Cite

CITATION STYLE

APA

Prasath, R. R. (2010). Learning age and gender using co-occurrence of non-dictionary words from stylistic variations. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 6086 LNAI, pp. 544–550). https://doi.org/10.1007/978-3-642-13529-3_58

Learning age and gender using co-occurrence of non-dictionary words from stylistic variations

Abstract

Author supplied keywords

Cite

Register to see more suggestions