An unsupervised framework of exploring events on Twitter: Filtering, extraction and categorization

65Citations
Citations of this article
104Readers
Mendeley users who have this article in their library.

Abstract

Twitter, as a popular microblogging service, has become a new information channel for users to receive and exchange the most up-to-date information on current events. However, since there is no control on how users can publish messages on Twitter, finding newsworthy events from Twitter becomes a difficult task like "finding a needle in a haystack". In this paper we propose a general unsupervised framework to explore events from tweets, which consists of a pipeline process of filtering, extraction and categorization. To filter out noisy tweets, the filtering step exploits a lexicon-based approach to separate tweets that are event-related from those that are not. Then, based on these event-related tweets, the structured representations of events are extracted and categorized automatically using an unsupervised Bayesian model without the use of any labelled data. Moreover, the categorized events are assigned with the event type labels without human intervention. The proposed framework has been evaluated on over 60 millions tweets which were collected for one month in December 2010. A precision of 70.49% is achieved in event extraction, outperforming a competitive baseline by nearly 6%. Events are also clustered into coherence groups with the automatically assigned event type label.

Cite

CITATION STYLE

APA

Zhou, D., Chen, L., & He, Y. (2015). An unsupervised framework of exploring events on Twitter: Filtering, extraction and categorization. In Proceedings of the National Conference on Artificial Intelligence (Vol. 3, pp. 2468–2474). AI Access Foundation. https://doi.org/10.1609/aaai.v29i1.9526

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free