Toolkits for generating wrappers: A survey of software toolkits for automated data extraction from web sites

31Citations
Citations of this article
6Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Various web applications in e-business, such as online price comparisons, competition monitoring and personalised newsletters require retrieval of distributed information from the Internet. This paper examines the suitability of software toolkits for the extraction of data from web sites. The term wrapper is defined and an overview of presently available toolkits for generating wrappers is provided. In order to give a better insight into the workings of such toolkits, a detailed analysis of the non-commercial software program LAPIS is presented. An example application using this toolkit demonstrates how acceptable results can be achieved with relative ease. The functionality of the program is compared with the functionality of the commercial toolkit RoboMaker and the differences are highlighted. With the aim of providing improved ease-of-use and faster wrapper generation in mind, possible areas for further development of toolkits for automated web data extraction are discussed. © Springer-Verlag Berlin Heidelberg 2003.

Cite

CITATION STYLE

APA

Kuhlins, S., & Tredwell, R. (2003). Toolkits for generating wrappers: A survey of software toolkits for automated data extraction from web sites. Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2591, 184–198. https://doi.org/10.1007/3-540-36557-5_15

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free