MBCrawler: A software architecture for micro-blog crawler

1Citations
Citations of this article
6Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Getting data is the precondition of researching on micro-blogging services. By using Web 2.0 techniques such as AJAX, the contents of micro-blog Web pages are dynamically generated rapidly. That makes it hard for traditional Web page crawler to crawl micro-blog Web pages. Micro-blogging services provide some APIs. Through the APIs, well-structured data can be easily obtained. A software architecture for micro-blogging service crawler, which is named as MBCrawler, is designed basing on the APIs provided by micro-blogging services. The architecture is modular and scalable, so it can fit specific features of different micro-blogging services. SinaMBCrawler, which is a crawler application based on MBCrawler for Sina Weibo, has been developed. It automatically invokes the APIs of Sina Weibo to crawl data. The crawled data is saved into local database. © 2013 Springer-Verlag.

Cite

CITATION STYLE

APA

Lu, G., Liu, S., & Lü, K. (2013). MBCrawler: A software architecture for micro-blog crawler. In Lecture Notes in Electrical Engineering (Vol. 212 LNEE, pp. 119–127). https://doi.org/10.1007/978-3-642-34531-9_13

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free