Directed retrieval and extraction of high-quality product specifications

Maximilian Walther; Ludwig Hähne; Daniel Schuster; Alexander Schill

Conference Proceedings

Directed retrieval and extraction of high-quality product specifications

Lecture Notes in Business Information Processing (2011) 73 LNBIP 436-450

DOI: 10.1007/978-3-642-19802-1_30

0Citations

2Readers

Get full text

Abstract

In recent years, a large quantity of algorithms has been presented for extracting information from semi-structured sources like HTML pages. Some of them already focus on product information and are adopted, e.g., in online platforms. However, most of those algorithms do not specifically target technical product specifications and never take the localization of such specifications into account. This work focuses on automating the whole process of retrieving and extracting product specifications. It achieves a high data quality by directing the source retrieval to producer pages where product specifications are extracted in an unsupervised manner. The resulting specifications are of high relevance to consumers since they enable effective product comparisons. The success of the developed algorithms is proven by a federated information system called Fedseeko. © 2011 Springer-Verlag Berlin Heidelberg.

Author supplied keywords

Cite

CITATION STYLE

APA

Walther, M., Hähne, L., Schuster, D., & Schill, A. (2011). Directed retrieval and extraction of high-quality product specifications. In Lecture Notes in Business Information Processing (Vol. 73 LNBIP, pp. 436–450). Springer Verlag. https://doi.org/10.1007/978-3-642-19802-1_30

Directed retrieval and extraction of high-quality product specifications

Abstract

Author supplied keywords

Cite

Register to see more suggestions