Getting data is the precondition of researching on micro-blogging services. By using Web 2.0 techniques such as AJAX, the contents of micro-blog Web pages are dynamically generated rapidly. That makes it hard for traditional Web page crawler to crawl micro-blog Web pages. Micro-blogging services provide some APIs. Through the APIs, well-structured data can be easily obtained. A software architecture for micro-blogging service crawler, which is named as MBCrawler, is designed basing on the APIs provided by micro-blogging services. The architecture is modular and scalable, so it can fit specific features of different micro-blogging services. SinaMBCrawler, which is a crawler application based on MBCrawler for Sina Weibo, has been developed. It automatically invokes the APIs of Sina Weibo to crawl data. The crawled data is saved into local database. © 2013 Springer-Verlag.
CITATION STYLE
Lu, G., Liu, S., & Lü, K. (2013). MBCrawler: A software architecture for micro-blog crawler. In Lecture Notes in Electrical Engineering (Vol. 212 LNEE, pp. 119–127). https://doi.org/10.1007/978-3-642-34531-9_13
Mendeley helps you to discover research relevant for your work.