Please use this identifier to cite or link to this item:
Title: Machine learning from crowds using candidate set-based labelling
Author: Beñaran-Muñoz, Iker
Hernández-González, Jerónimo
Pérez, Aritz
Keywords: Aprenentatge automàtic
Cultura participativa
Dades massives
Machine learning
Participatory culture
Big data
Issue Date: 8-Sep-2022
Publisher: Institute of Electrical and Electronics Engineers (IEEE)
Abstract: Crowdsourcing is a popular cheap alternative in machine learning for gathering information from a set of annotators. Learning from crowd-labelled data involves dealing with its inherent uncertainty and inconsistencies. In the classical framework, each annotator provides a single label per example, which fails to capture the complete knowledge of annotators. We propose candidate labelling, that is, to allow annotators to provide a set of candidate labels for each example and thus express their doubts. We propose an appropriate model for the annotators, and present two novel learning methods that deal with the two basic steps (label aggregation and model learning) sequentially or jointly. Our empirical study shows the advantage of candidate labelling and the proposed methods with respect to the classical framework.
Note: Reproducció del document publicat a:
It is part of: IEEE Intelligent Systems, 2022
Related resource:
ISSN: 1541-1672
Appears in Collections:Articles publicats en revistes (Matemàtiques i Informàtica)

Files in This Item:
File Description SizeFormat 
725385.pdf661.1 kBAdobe PDFView/Open

This item is licensed under a Creative Commons License Creative Commons