Semantic feature aggregation for gender identification in Russian facebook

7Citations
Citations of this article
8Readers
Mendeley users who have this article in their library.
Get full text

Abstract

The goal of the current work is to evaluate semantic feature aggregation techniques in a task of gender classification of public social media texts in Russian. We collect Facebook posts of Russian-speaking users and apply them as a dataset for two topic modelling techniques and a distributional clustering approach. The output of the algorithms is applied as a feature aggregation method in a task of gender classification based on a smaller Facebook sample. The classification performance of the best model is favorably compared against the lemmas baseline and the state-of-the-art results reported for a different genre or language. The resulting successful features are exemplified, and the difference between the three techniques in terms of classification performance and feature contents are discussed, with the best technique clearly outperforming the others.

Cite

CITATION STYLE

APA

Panicheva, P., Mirzagitova, A., & Ledovaya, Y. (2018). Semantic feature aggregation for gender identification in Russian facebook. In Communications in Computer and Information Science (Vol. 789, pp. 3–15). Springer Verlag. https://doi.org/10.1007/978-3-319-71746-3_1

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free