SQUARE: A Benchmark for Research on Computing Crowd Consensus

Aashish Sheshadri; Matthew Lease

Conference ProceedingsOPEN ACCESS

SQUARE: A Benchmark for Research on Computing Crowd Consensus

Proceedings of the 1st AAAI Conference on Human Computation and Crowdsourcing, HCOMP 2013 (2013) 156-164

DOI: 10.1609/hcomp.v1i1.13088

155Citations

83Readers

Abstract

While many statistical consensus methods now exist, relatively little comparative benchmarking and integration of techniques has made it increasingly difficult to determine the current state-of-the-art, to evaluate the relative benefit of new methods, to understand where specific problems merit greater attention, and to measure field progress over time. To make such comparative evaluation easier for everyone, we present SQUARE, an open source shared task framework including benchmark datasets, defined tasks, standard metrics, and reference implementations with empirical results for several popular methods. In addition to measuring performance on a variety of public, real crowd datasets, the benchmark also varies supervision and noise by manipulating training size and labeling error. We envision SQUARE as dynamic and continually evolving, with new datasets and reference implementations being added according to community needs and interest. We invite community contributions and participation.

Cite

CITATION STYLE

APA

Sheshadri, A., & Lease, M. (2013). SQUARE: A Benchmark for Research on Computing Crowd Consensus. In Proceedings of the 1st AAAI Conference on Human Computation and Crowdsourcing, HCOMP 2013 (pp. 156–164). AAAI Press. https://doi.org/10.1609/hcomp.v1i1.13088

SQUARE: A Benchmark for Research on Computing Crowd Consensus

Abstract

Cite

Register to see more suggestions