Twitter is a ubiquitous source of micro-blog social media data, providing the academic, industrial, and public sectors real-time access to actionable information. A particularly attractive property of some tweets is geo-tagging, where a user account has opted-in to attaching their current location to each message. Unfortunately (from a researcher's perspective) only a fraction of Twitter accounts agree to this, and these accounts are likely to have systematic diffences with the general population. This work is an exploratory study of these differences across the full range of Twitter content, and complements previous studies that focus on the English-language subset. Additionally, we compare methods for querying users by self-identified properties, finding that the constrained semantics of the “description” field provides cleaner, higher-volume results than more complex regular expressions.
CITATION STYLE
Lippincott, T., & Carrell, A. (2018). Observational comparison of geo-tagged and randomly-drawn tweets. In Proceedings of the 2nd Workshop on Computational Modeling of PFople’s Opinions, PersonaLity, and Emotions in Social Media, PEOPLES 2018 at the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL-HTL 2018 (pp. 50–55). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/w18-1107
Mendeley helps you to discover research relevant for your work.