Formalizing the Problem of Learning with Imprecise Data

dc.contributor.advisorNagarajan, Bhalaji
dc.contributor.advisorRadeva, Petia
dc.contributor.advisorHaro, Àlex
dc.contributor.authorDíaz Acevedo, Ana
dc.date.accessioned2026-02-24T12:33:36Z
dc.date.available2026-02-24T12:33:36Z
dc.date.issued2025-06-10
dc.descriptionTreballs Finals de Grau de Matemàtiques, Facultat de Matemàtiques, Universitat de Barcelona, Any: 2025, Director: Bhalaji Nagarajan, Petia Radeva i Àlex Haro
dc.description.abstractDeep learning is a powerful tool for complex tasks including image classification, but its success heavily depends on the availability of high-quality, correctly labeled data. In practice, however, datasets often contain imprecise labels—annotations that are ambiguous, incomplete, or incorrect. This thesis addresses the central challenge of how to build reliable learning systems when the data they depend on cannot be fully trusted. At first, the thesis provides a rigorous mathematical formalization of the core concepts in machine learning, with a particular emphasis on deep learning frameworks. Building on this foundation, it then introduces and studies a specialized framework that models the learning process under imprecise labels, where the assumptions of standard supervised learning no longer hold. Through the lens of statistical modeling, we explore how uncertainty in labels can be incorporated into deep learning models, treating imprecision not as noise to ignore, but as a structure to model. A key contribution is showing how such a framework defines a parametric model amenable to inference techniques like Maximum Likelihood Estimation (MLE). The practical component of the thesis involves implementing the framework using real image datasets, with experiments designed to study how imprecise labels influence the learning process of deep networks. These results help identify strategies for mitigating the negative effects of label noise and contribute to building more robust and theoretically grounded learning systems. By bridging the gap between theoretical foundations and practical implementations, this work aims to deepen the understanding of learning under imprecision, which is critical for deploying deep learning models in real-world applications. The insights gained have broader implications beyond image classification, potentially benefiting various domains and tasks where data quality is a concern. Ultimately, this thesis seeks to pave the way for more reliable and interpretable machine learning models capable of handling the complexities of imperfect data.
dc.format.extent53 p.
dc.format.mimetypeapplication/pdf
dc.identifier.urihttps://hdl.handle.net/2445/227311
dc.language.isoeng
dc.rightscc-by-nc-nd (c) Ana Díaz Acevedo, 2025
dc.rights.accessRightsinfo:eu-repo/semantics/openAccess
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/3.0/es
dc.sourceTreballs Finals de Grau (TFG) - Matemàtiques
dc.subject.classificationAprenentatge automàticca
dc.subject.classificationAprenentatge profundca
dc.subject.classificationIntel·ligència artificialca
dc.subject.classificationAna Díaz Acevedoca
dc.subject.classificationTreballs de fi de grauca
dc.subject.otherMachine learningen
dc.subject.otherDeep learning (Machine learning)en
dc.subject.otherArtificial intelligenceen
dc.subject.otherBachelor's thesesen
dc.titleFormalizing the Problem of Learning with Imprecise Data
dc.typeinfo:eu-repo/semantics/bachelorThesis

Fitxers

Paquet original

Mostrant 1 - 1 de 1
Carregant...
Miniatura
Nom:
TFG_Diaz_Acevedo_Ana.pdf
Mida:
3.18 MB
Format:
Adobe Portable Document Format