Similarity based web data extraction and integration system for web content mining

3Citations
Citations of this article
5Readers
Mendeley users who have this article in their library.
Get full text

Abstract

The Internet is a major source of all information that we essentially need. The information on the web cannot be analyzed and queried as per the user requests. Here, we propose and develop a similarity based web data extraction and integration system (WDES and WDICS) to extract search result pages from the web and integrate its contents to enable the user to perform intended analysis. The system provides for local replication of search result pages, in a manner convenient for offline browsing. The system organizes itself into two possible phases that are involved in performing the above task. We develop and implement algorithms for extracting and integrating the content from the web. Experiment is performed on the contents of Bluetooth product listings and it gives us a better Precision and Recall than DEPTA [1]. © 2012 ICST Institute for Computer Science, Social Informatics and Telecommunications Engineering.

Cite

CITATION STYLE

APA

Srikantaiah, K. C., Suraj, M., Venugopal, K. R., Iyengar, S. S., & Patnaik, L. M. (2012). Similarity based web data extraction and integration system for web content mining. In Lecture Notes of the Institute for Computer Sciences, Social-Informatics and Telecommunications Engineering (Vol. 108 LNICST, pp. 269–274). https://doi.org/10.1007/978-3-642-35615-5_41

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free