Retrieving relevant portions from structured digital documents

3Citations
Citations of this article
5Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Retrieving relevant portions from structured documents consisting of logical components has been a challenging task in both the database and the information retrieval world, since an answer related to a query may be split across multiple components. In this paper, we propose a query mechanism that applies database style query evaluation in response to IR style keyword-based queries for retrieving relevant answers from a logically structured document. We first define an appropriate semantics of keywords-based queries and then propose an algebra that is capable of computing every relevant portion of a document, which can be considered answer to a set of arbitrary keywords. The ordering and structural relationship among the components are preserved in the answer. We also introduce several practically useful filters that saves users from having to deal with an overwhelming number of answers. © Springer-Verlag Berlin Heidelberg 2004.

Cite

CITATION STYLE

APA

Pradhan, S., & Tanaka, K. (2004). Retrieving relevant portions from structured digital documents. Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 3180, 328–338. https://doi.org/10.1007/978-3-540-30075-5_32

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free