Directed retrieval and extraction of high-quality product specifications

0Citations
Citations of this article
2Readers
Mendeley users who have this article in their library.
Get full text

Abstract

In recent years, a large quantity of algorithms has been presented for extracting information from semi-structured sources like HTML pages. Some of them already focus on product information and are adopted, e.g., in online platforms. However, most of those algorithms do not specifically target technical product specifications and never take the localization of such specifications into account. This work focuses on automating the whole process of retrieving and extracting product specifications. It achieves a high data quality by directing the source retrieval to producer pages where product specifications are extracted in an unsupervised manner. The resulting specifications are of high relevance to consumers since they enable effective product comparisons. The success of the developed algorithms is proven by a federated information system called Fedseeko. © 2011 Springer-Verlag Berlin Heidelberg.

Cite

CITATION STYLE

APA

Walther, M., Hähne, L., Schuster, D., & Schill, A. (2011). Directed retrieval and extraction of high-quality product specifications. In Lecture Notes in Business Information Processing (Vol. 73 LNBIP, pp. 436–450). Springer Verlag. https://doi.org/10.1007/978-3-642-19802-1_30

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free