Abstract
While user attribute extraction on social media has received considerable attention, existing approaches, mostly supervised, encounter great difficulty in obtaining gold standard data and are therefore limited to predicting unary predicates (e.g., gender). In this paper, we present a weaklysupervised approach to user profile extraction from Twitter. Users' profiles from social media websites such as Facebook or Google Plus are used as a distant source of supervision for extraction of their attributes from user-generated text. In addition to traditional linguistic features used in distant supervision for information extraction, our approach also takes into account network information, a unique opportunity offered by social media. We test our algorithm on three attribute domains: spouse, education and job; experimental results demonstrate our approach is able to make accurate predictions for users' attributes based on their tweets. © 2014 Association for Computational Linguistics.
Cite
CITATION STYLE
Li, J., Ritter, A., & Hovy, E. (2014). Weakly supervised user profile extraction from twitter. In 52nd Annual Meeting of the Association for Computational Linguistics, ACL 2014 - Proceedings of the Conference (Vol. 1, pp. 165–174). Association for Computational Linguistics (ACL). https://doi.org/10.3115/v1/p14-1016
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.