In this tutorial, we introduce a novel crowdsourcing methodology called CrowdTruth [1, 9]. The central characteristic of CrowdTruth is harnessing the diversity in human interpretation to capture the wide range of opinions and perspectives, and thus provide more reliable, realistic and inclusive real-world annotated data for training and evaluating machine learning components. Unlike other methods, we do not discard dissenting votes, but incorporate them into a richer and more continuous representation of truth. CrowdTruth is a widely used crowdsourcing methodology1 adopted by industrial partners and public organizations such as Google, IBM, New York Times, Cleveland Clinic, Crowdynews, Sound and Vision archive, Rijksmuseum, and in a multitude of domains such as AI, news, medicine, social media, cultural heritage, and social sciences. The goal of this tutorial is to introduce the audience to a novel approach to crowdsourcing that takes advantage of the diversity of opinions and perspectives that is inherent to the Web, as methods that deal with disagreement and diversity in crowdsourcing have become increasingly popular. Creating this more complex notion of truth contributes directly to the larger discussion on how to make the Web more reliable, diverse and inclusive.
CITATION STYLE
Aroyo, L., Szlávik, Z., Dumitrache, A., Timmermans, B., Inel, O., & Welty, C. (2019). Crowdsourcing inclusivity: Dealing with diversity of opinions, perspectives and ambiguity in annotated data: The Crowdtruth tutorial. In The Web Conference 2019 - Companion of the World Wide Web Conference, WWW 2019 (pp. 1294–1295). Association for Computing Machinery, Inc. https://doi.org/10.1145/3308558.3320096
Mendeley helps you to discover research relevant for your work.