What was the query? Generating queries for document sets with applications in cluster labeling

Matthias Hagen; Maximilian Michel; Benno Stein

Conference Proceedings

What was the query? Generating queries for document sets with applications in cluster labeling

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2015) 9103 124-133

DOI: 10.1007/978-3-319-19581-0_10

7Citations

4Readers

Get full text

Abstract

We deal with the task of generating a query that retrieves a given set of documents. In its abstract form, this can be seen as a “compression” of the document set to a short query. But the task also has a real-world application: cluster labeling (e.g., for faceted search). Our solution to cluster labeling is the usage of queries that approximately retrieve a cluster’s documents. To be generalizable, our approach does not require access to a search index but only a public interface like an API. This way, our approach can also be implemented at client side. In an experimental evaluation, a basic version of our approach using a simple retrieval model is on par with standard cluster labeling techniques. A further user study reveals that queries as labels are often preferred when they are not too long.

Cite

CITATION STYLE

APA

Hagen, M., Michel, M., & Stein, B. (2015). What was the query? Generating queries for document sets with applications in cluster labeling. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 9103, pp. 124–133). Springer Verlag. https://doi.org/10.1007/978-3-319-19581-0_10

What was the query? Generating queries for document sets with applications in cluster labeling

Abstract

Cite

Register to see more suggestions