When some hot social issue or event occurs, it will significantly increase the number of comments and retweet on that day on twitter. Generally, an event can be extracted by its term frequency but it is hard to find an event that has a low term frequency. Because of this reason there can be a probability of missing important information. However, there is a kind of reliable user who is directly related to that event so that no matter how low the number of tweet is on that case. In this paper, we propose user reliability based event extraction method. The latent Dirichlet allocation(LDA) model is adapted with timeline analysis to extract high-frequency events. User behaviors are analyzed to classify reliable users who are directly related to the issue. Reliable low-frequency events can be detected based on reliable users. In order to verify the effectiveness of the proposed method, four social issues are selected and experimented on Korean twitter test set. The experimental results showed 97.2% in precision for the top 10 extracted events (P@10) on each day. This result shows that the proposed method is effective for extracting events in twitter corpus. © 2014 Springer-Verlag Berlin Heidelberg.
CITATION STYLE
Tsolmon, B., & Lee, K. S. (2014). Extracting social events based on timeline and user reliability analysis on twitter. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 8404 LNCS, pp. 213–223). Springer Verlag. https://doi.org/10.1007/978-3-642-54903-8_18
Mendeley helps you to discover research relevant for your work.