A faceted crawler for the Twitter service

George Valkanas; Antonia Saravanou; Dimitrios Gunopulos

Journal Article

A faceted crawler for the Twitter service

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2014) 8787 178-188

DOI: 10.1007/978-3-319-11746-1_13

6Citations

44Readers

Get full text

Abstract

Researchers, nowadays, have at their disposal valuable data from social networking applications, of which Twitter and Facebook are the most prominent examples. To retrieve this content, the Twitter service provides 2 distinct Application Programming Interfaces (APIs): a probe-based and a streaming one, each of which imposes different limitations on the data collection process. In this paper, we present a general architecture to facilitate faceted crawling of the service, which simplifies retrieval. We give implementation details of our system, while providing a simple way to express the crawling process, i.e., the crawl flow. We experimentally evaluate it on a variety of faceted crawls, depicting its efficacy for the online medium.

Author supplied keywords

Cite

CITATION STYLE

APA

Valkanas, G., Saravanou, A., & Gunopulos, D. (2014). A faceted crawler for the Twitter service. Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 8787, 178–188. https://doi.org/10.1007/978-3-319-11746-1_13

A faceted crawler for the Twitter service

Abstract

Author supplied keywords

Cite

Register to see more suggestions