On the use of the descriptive variable for enhancing the aggregation of crowdsourced labels

dc.contributor.authorBeñaran-Muñoz, Iker
dc.contributor.authorHernández-González, Jerónimo
dc.contributor.authorPérez, Aritz
dc.date.accessioned2022-10-03T09:02:56Z
dc.date.available2022-10-03T09:02:56Z
dc.date.issued2022-09-30
dc.date.updated2022-10-03T09:02:57Z
dc.description.abstractThe use of crowdsourcing for annotating data has become a popular and cheap alternative to expert labelling. As a consequence, an aggregation task is required to combine the different labels provided and agree on a single one per example. Most aggregation techniques, including the simple and robust majority voting¿to select the label with the largest number of votes¿disregard the descriptive information provided by the explanatory variable. In this paper, we propose domain-aware voting, an extension of majority voting which incorporates the descriptive variable and the rest of the instances of the dataset for aggregating the label of every instance. The experimental results with simulated and real-world crowdsourced data suggest that domain-aware voting is a competitive alternative to majority voting, especially when a part of the dataset is unlabelled. We elaborate on practical criteria for the use of domain-aware voting.
dc.format.mimetypeapplication/pdf
dc.identifier.idgrec725388
dc.identifier.issn0219-1377
dc.identifier.urihttps://hdl.handle.net/2445/189541
dc.language.isoeng
dc.publisherSpringer Verlag
dc.relation.isformatofReproducció del document publicat a: https://doi.org/10.1007/s10115-022-01743-z
dc.relation.ispartofKnowledge and Information Systems, 2022
dc.relation.urihttps://doi.org/10.1007/s10115-022-01743-z
dc.rightscc by (c) Iker Beñaran-Muñoz, et al., 2022
dc.rights.accessRightsinfo:eu-repo/semantics/openAccess
dc.rights.urihttp://creativecommons.org/licenses/by/3.0/es/*
dc.sourceArticles publicats en revistes (Matemàtiques i Informàtica)
dc.subject.classificationAprenentatge automàtic
dc.subject.classificationCultura participativa
dc.subject.classificationDades massives
dc.subject.otherMachine learning
dc.subject.otherParticipatory culture
dc.subject.otherBig data
dc.titleOn the use of the descriptive variable for enhancing the aggregation of crowdsourced labels
dc.typeinfo:eu-repo/semantics/article
dc.typeinfo:eu-repo/semantics/publishedVersion

Fitxers

Paquet original

Mostrant 1 - 1 de 1
Carregant...
Miniatura
Nom:
725388.pdf
Mida:
1.71 MB
Format:
Adobe Portable Document Format