Formalizing the Problem of Learning with Imprecise Data

Díaz Acevedo, Ana

Formalizing the Problem of Learning with Imprecise Data

dc.contributor.advisor	Nagarajan, Bhalaji
dc.contributor.advisor	Radeva, Petia
dc.contributor.advisor	Haro, Àlex
dc.contributor.author	Díaz Acevedo, Ana
dc.date.accessioned	2026-02-24T12:33:36Z
dc.date.available	2026-02-24T12:33:36Z
dc.date.issued	2025-06-10
dc.description	Treballs Finals de Grau de Matemàtiques, Facultat de Matemàtiques, Universitat de Barcelona, Any: 2025, Director: Bhalaji Nagarajan, Petia Radeva i Àlex Haro
dc.description.abstract	Deep learning is a powerful tool for complex tasks including image classification, but its success heavily depends on the availability of high-quality, correctly labeled data. In practice, however, datasets often contain imprecise labels—annotations that are ambiguous, incomplete, or incorrect. This thesis addresses the central challenge of how to build reliable learning systems when the data they depend on cannot be fully trusted. At first, the thesis provides a rigorous mathematical formalization of the core concepts in machine learning, with a particular emphasis on deep learning frameworks. Building on this foundation, it then introduces and studies a specialized framework that models the learning process under imprecise labels, where the assumptions of standard supervised learning no longer hold. Through the lens of statistical modeling, we explore how uncertainty in labels can be incorporated into deep learning models, treating imprecision not as noise to ignore, but as a structure to model. A key contribution is showing how such a framework defines a parametric model amenable to inference techniques like Maximum Likelihood Estimation (MLE). The practical component of the thesis involves implementing the framework using real image datasets, with experiments designed to study how imprecise labels influence the learning process of deep networks. These results help identify strategies for mitigating the negative effects of label noise and contribute to building more robust and theoretically grounded learning systems. By bridging the gap between theoretical foundations and practical implementations, this work aims to deepen the understanding of learning under imprecision, which is critical for deploying deep learning models in real-world applications. The insights gained have broader implications beyond image classification, potentially benefiting various domains and tasks where data quality is a concern. Ultimately, this thesis seeks to pave the way for more reliable and interpretable machine learning models capable of handling the complexities of imperfect data.
dc.format.extent	53 p.
dc.format.mimetype	application/pdf
dc.identifier.uri	https://hdl.handle.net/2445/227311
dc.language.iso	eng
dc.rights	cc-by-nc-nd (c) Ana Díaz Acevedo, 2025
dc.rights.accessRights	info:eu-repo/semantics/openAccess
dc.rights.uri	http://creativecommons.org/licenses/by-nc-nd/3.0/es
dc.source	Treballs Finals de Grau (TFG) - Matemàtiques
dc.subject.classification	Aprenentatge automàtic	ca
dc.subject.classification	Aprenentatge profund	ca
dc.subject.classification	Intel·ligència artificial	ca
dc.subject.classification	Ana Díaz Acevedo	ca
dc.subject.classification	Treballs de fi de grau	ca
dc.subject.other	Machine learning	en
dc.subject.other	Deep learning (Machine learning)	en
dc.subject.other	Artificial intelligence	en
dc.subject.other	Bachelor's theses	en
dc.title	Formalizing the Problem of Learning with Imprecise Data
dc.type	info:eu-repo/semantics/bachelorThesis

Fitxers

Paquet original

Mostrant 1 - 1 de 1

Nom:: TFG_Diaz_Acevedo_Ana.pdf
Mida:: 3.18 MB
Format:: Adobe Portable Document Format

Descarregar

Col·leccions

Treballs Finals de Grau (TFG) - Matemàtiques