Please use this identifier to cite or link to this item: http://hdl.handle.net/2445/186169
Title: Exploring self-supervised learning in deep learning for food recognition
Author: Ballús Riu, Nil
Director/Tutor: Radeva, Petia
Haro, Àlex
Keywords: Aprenentatge automàtic
Treballs de fi de grau
Visió per ordinador
Aliments
Sistemes classificadors (Intel·ligència artificial)
Machine learning
Bachelor's theses
Computer vision
Food
Learning classifier systems
Issue Date: 24-Jan-2022
Abstract: [en] Deep Learning is a field that evolves at dizzying rates and that allows obtaining extraordinary results in many applications of scientific and, even, everyday scope. In the field of computer vision, and especially in the food image recognition task, Machine Learning allows us to achieve accuracies close to human capacity. However, Machine Learning-based models often require a large number of labeled example images to carry out the learning process. Although we currently have millions of images, for example from the Internet or social networks, the main problem is that the process of labeling images is very expensive, difficult, and sometimes impossible or unfeasible. Therefore, the question we ask ourselves is: can we take advantage of large volumes of unlabeled images to help solving the problem of Supervised Learning? To answer the above question, in the project we have set out to explore the newest Self-supervised Learning methods to test whether the use of unlabeled images can help a food classifier improve its accuracy. Once analyzed in detail, we have proposed a new Self-Supervised Learning method, called OptSSL, which shows great performance in representing food images in a latent space. To demonstrate its potential, we have made an exhaustive comparison of this method with other recent Self-supervised Learning methods proposed in the literature. Another problem discussed in this paper is the modeling of the uncertainty of the predictions of a model. Most current Deep Learning models are not able to decide whether their predictions are confident or not. Thus, in the project, this property is incorporated into the analyzed classifiers, and the performance of the classifier is analyzed not only from the point of view of its accuracy but also from its uncertainty, making a comparison of its robustness in this context. As a result of the work, we show that OptSSL is an optimal method for data representation, and in particular of food images, which benefits from unlabeled images to learn the representation. Finally, we show how the OptSSL method can be used as a pre-training phase to subsequently train a food classifier, obtaining high accuracy and low uncertainty in predictions.
Note: Treballs Finals de Grau de Matemàtiques, Facultat de Matemàtiques, Universitat de Barcelona, Any: 2022, Director: Petia Radeva i Àlex Haro
URI: http://hdl.handle.net/2445/186169
Appears in Collections:Treballs Finals de Grau (TFG) - Enginyeria Informàtica
Treballs Finals de Grau (TFG) - Matemàtiques
Programari - Treballs de l'alumnat

Files in This Item:
File Description SizeFormat 
codi.zipCodi font359.7 kBzipView/Open
tfg_ballus_riu_nil.pdfMemòria18.46 MBAdobe PDFView/Open


This item is licensed under a Creative Commons License Creative Commons