Towards more personalized Web: Extraction and integration of dynamic content from the web

Marek Kowalkiewicz; Maria E. Orlowska; Tomasz Kaczmarek; Witold Abramowicz

Conference Proceedings

Towards more personalized Web: Extraction and integration of dynamic content from the web

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2006) 3841 LNCS 668-679

DOI: 10.1007/11610113_58

4Citations

7Readers

Get full text

Abstract

Information and content integration are believed to be a possible solution to the problem of information overload in the Internet. The article is an overview of a simple solution for integration of information and content on the Web. Previous approaches to content extraction and integration are discussed, followed by introduction of a novel technology to deal with the problems, based on XML processing. The article includes lessons learned from solving issues of changing webpage layout, incompatibility with HTML standards and multiplicity of the results returned. The method adopting relative XPath queries over DOM tree proves to be more robust than previous approaches to Web information integration. Furthermore, the prototype implementation demonstrates the simplicity that enables non-professional users to easily adopt this approach in their day-to-day information management routines. © Springer-Verlag Berlin Heidelberg 2006.

Cite

CITATION STYLE

APA

Kowalkiewicz, M., Orlowska, M. E., Kaczmarek, T., & Abramowicz, W. (2006). Towards more personalized Web: Extraction and integration of dynamic content from the web. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 3841 LNCS, pp. 668–679). https://doi.org/10.1007/11610113_58

Towards more personalized Web: Extraction and integration of dynamic content from the web

Abstract

Cite

Register to see more suggestions