An unsupervised framework of exploring events on Twitter: Filtering, extraction and categorization

Deyu Zhou; Liangyu Chen; Yulan He

Conference ProceedingsOPEN ACCESS

An unsupervised framework of exploring events on Twitter: Filtering, extraction and categorization

Proceedings of the National Conference on Artificial Intelligence (2015) 3 2468-2474

DOI: 10.1609/aaai.v29i1.9526

65Citations

101Readers

Abstract

Twitter, as a popular microblogging service, has become a new information channel for users to receive and exchange the most up-to-date information on current events. However, since there is no control on how users can publish messages on Twitter, finding newsworthy events from Twitter becomes a difficult task like "finding a needle in a haystack". In this paper we propose a general unsupervised framework to explore events from tweets, which consists of a pipeline process of filtering, extraction and categorization. To filter out noisy tweets, the filtering step exploits a lexicon-based approach to separate tweets that are event-related from those that are not. Then, based on these event-related tweets, the structured representations of events are extracted and categorized automatically using an unsupervised Bayesian model without the use of any labelled data. Moreover, the categorized events are assigned with the event type labels without human intervention. The proposed framework has been evaluated on over 60 millions tweets which were collected for one month in December 2010. A precision of 70.49% is achieved in event extraction, outperforming a competitive baseline by nearly 6%. Events are also clustered into coherence groups with the automatically assigned event type label.

Cite

CITATION STYLE

APA

Zhou, D., Chen, L., & He, Y. (2015). An unsupervised framework of exploring events on Twitter: Filtering, extraction and categorization. In Proceedings of the National Conference on Artificial Intelligence (Vol. 3, pp. 2468–2474). AI Access Foundation. https://doi.org/10.1609/aaai.v29i1.9526

An unsupervised framework of exploring events on Twitter: Filtering, extraction and categorization

Abstract

Cite

Register to see more suggestions