Abstract
Various applications are developed today on top of microblogging services like Twitter. In order to engineer Web applications which operate on microblogging data, there is a need for appropriate filtering techniques to identify messages. In this paper, we focus on detecting Twitter messages (tweets) that report on social events. We introduce a filtering pipeline that exploits textual features and n-grams to classify messages into event related and non-event related tweets. We analyze the impact of preprocessing techniques, achieving accuracies higher than 80%. Further, we present a strategy to automate labeling of training data, since our proposed filtering pipeline requires training data. When testing on our dataset, this semi-automated method achieves an accuracy of 79% and results comparable to the manual labeling approach. © 2012 Springer-Verlag.
Author supplied keywords
Cite
CITATION STYLE
Ilina, E., Hauff, C., Celik, I., Abel, F., & Houben, G. J. (2012). Social event detection on Twitter. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 7387 LNCS, pp. 169–176). https://doi.org/10.1007/978-3-642-31753-8_12
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.