DataSift: An Expressive and Accurate Crowd-Powered Search Toolkit

13Citations
Citations of this article
25Readers
Mendeley users who have this article in their library.

Abstract

Traditional information retrieval systems have limited functionality. For instance, they are not able to adequately support queries containing non-textual fragments such as images or videos, queries that are very long or ambiguous, or semantically-rich queries over non-textual corpora. In this paper, we present DataSift, an expressive and accurate crowd-powered search toolkit that can connect to any corpus. We provide a number of alternative configurations for DataSift using crowdsourced and automated components, and demonstrate gains of 2–3x on precision over traditional retrieval schemes using experiments on real corpora. We also present our results on determining suitable values for parameters in those configurations, along with a number of interesting insights learned along the way.

Cite

CITATION STYLE

APA

Parameswaran, A., Teh, M. H., Garcia-Molina, H., & Widom, J. (2013). DataSift: An Expressive and Accurate Crowd-Powered Search Toolkit. In Proceedings of the 1st AAAI Conference on Human Computation and Crowdsourcing, HCOMP 2013 (pp. 112–120). AAAI Press. https://doi.org/10.1609/hcomp.v1i1.13077

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free