Filtering with the Crowd: CrowdScreen revisited (technical report)

Benoît Groz; Ezra Levin; Isaac Meilijson; Tova Milo

doi:10.4230/LIPIcs.ICDT.2016.12

Communication Dans Un Congrès Année : 2016

Filtering with the Crowd: CrowdScreen revisited (technical report)

(1, 2) , (1) , (3) , (1)

1
2
3

Benoît Groz

Fonction : Auteur
PersonId : 3136
IdHAL : benoit-groz
ORCID : 0000-0001-7292-6409
IdRef : 169343316

School of Computer Science

Données et Connaissances Massives et Hétérogènes (LRI)

Ezra Levin

Fonction : Auteur

School of Computer Science

Isaac Meilijson

Fonction : Auteur

School of Mathematical Sciences [Tel Aviv]

Tova Milo

Fonction : Auteur

School of Computer Science

Résumé

Filtering a set of items, based on a set of properties that can be verified by humans, is a common application of CrowdSourcing. When the workers are error-prone, each item is presented to multiple users, to limit the probability of misclassification. Since the Crowd is a relatively expensive resource, minimizing the number of questions per item may naturally result in big savings. Several algorithms to address this minimization problem have been presented in the CrowdScreen framework by Parameswaran et al. However, those algorithms do not scale well and therefore cannot be used in scenarios where high accuracy is required in spite of high user error rates. The goal of this paper is thus to devise algorithms that can cope with such situations. To achieve this, we provide new theoretical insights to the problem, then use them to develop a new efficient algorithm. We also propose novel optimizations for the algorithms of CrowdScreen that improve their scalability. We complement our theoretical study by an experimental evaluation of the algorithms on a large set of synthetic parameters as well as real-life crowdsourcing scenarios, demonstrating the advantages of our solution.

Domaines

Base de données [cs.DB] Algorithme et structure de données [cs.DS]

Fichier principal

glmm-icdt16-long.pdf (992.73 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Benoit Groz : Connectez-vous pour contacter le contributeur

https://hal.science/hal-01239458

Soumis le : lundi 7 décembre 2015-18:02:05

Dernière modification le : mardi 13 février 2024-03:25:16

Archivage à long terme le : samedi 29 avril 2017-08:42:31

Dates et versions

hal-01239458 , version 1 (07-12-2015)

Licence

Paternité

Identifiants

HAL Id : hal-01239458 , version 1
DOI : 10.4230/LIPIcs.ICDT.2016.12

Citer

Benoît Groz, Ezra Levin, Isaac Meilijson, Tova Milo. Filtering with the Crowd: CrowdScreen revisited (technical report). 19th International Conference on Database Theory (ICDT 2016), Mar 2016, Bordeaux, France. pp.12:1--12:18, ⟨10.4230/LIPIcs.ICDT.2016.12⟩. ⟨hal-01239458⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS UMR8623 CENTRALESUPELEC LRI-LAHDAK UNIV-PARIS-SACLAY LISN GS-ENGINEERING GS-COMPUTER-SCIENCE LISN-LAHDAK

505 Consultations

173 Téléchargements

Filtering with the Crowd: CrowdScreen revisited (technical report)

Résumé

Domaines

Dates et versions

Licence

Identifiants

Citer

Exporter

Collections

Altmetric

Partager