Abstract
This paper addresses the task of user classification in social media, with an application to Twitter. We automatically infer the values of user attributes such as political orientation or ethnicity by leveraging observable information such as the user behavior, network structure and the linguistic content of the user’s Twitter feed. We employ a machine learning approach which relies on a comprehensive set of features derived from such user information. We report encouraging experimental results on 3 tasks with different characteristics: political affiliation detection, ethnicity identification and detecting affinity for a particular business. Finally, our analysis shows that rich linguistic features prove consistently valuable across the 3 tasks and show great promise for additional user classification needs.
Cite
CITATION STYLE
Pennacchiotti, M., & Popescu, A. M. (2011). A Machine Learning Approach to Twitter User Classification. In Proceedings of the 5th International AAAI Conference on Weblogs and Social Media, ICWSM 2011 (pp. 281–288). AAAI Press. https://doi.org/10.1609/icwsm.v5i1.14139
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.