Please use this identifier to cite or link to this item:
http://hdl.handle.net/2445/173728
Full metadata record
DC Field | Value | Language |
---|---|---|
dc.contributor.advisor | Bolaños, Marc | - |
dc.contributor.advisor | Radeva, Petia | - |
dc.contributor.author | Peracaula Prat, Joan | - |
dc.date.accessioned | 2021-02-08T09:31:55Z | - |
dc.date.available | 2021-02-08T09:31:55Z | - |
dc.date.issued | 2020-09-13 | - |
dc.identifier.uri | http://hdl.handle.net/2445/173728 | - |
dc.description | Treballs Finals de Grau d'Enginyeria Informàtica, Facultat de Matemàtiques, Universitat de Barcelona, Any: 2020, Director: Marc Bolaños i Petia Radeva | ca |
dc.description.abstract | [en] Food recognition, object detection and classification applied to the food domain, is the main topic of this work. We have studied the problem of recognising food instances in tray images of self-service restaurants and have proposed a novel multimodal deep learning approach. From images and daily menus, the model presented uses two state of the art models in object detection and classification and a multimodal neural network to make significantly refined predictions compared to the baseline object detection model, achieving a class weighted average F1-score of 0.862. An ensemble model built from the proposed and the baseline models, also presented in this work, improves the results achieving a class weighted average F1-score of 0.877. | ca |
dc.format.extent | 81 p. | - |
dc.format.mimetype | application/pdf | - |
dc.language.iso | eng | ca |
dc.rights | memòria: cc-nc-nd (c) Joan Peracaula Prat, 2020 | - |
dc.rights | codi: GPL (c) Joan Peracaula Prat, 2019 | - |
dc.rights.uri | http://creativecommons.org/licenses/by-nc-nd/3.0/es/ | - |
dc.rights.uri | http://www.gnu.org/licenses/gpl-3.0.ca.html | * |
dc.source | Treballs Finals de Grau (TFG) - Enginyeria Informàtica | - |
dc.subject.classification | Xarxes neuronals (Informàtica) | ca |
dc.subject.classification | Aprenentatge automàtic | ca |
dc.subject.classification | Programari | ca |
dc.subject.classification | Treballs de fi de grau | ca |
dc.subject.classification | Processament digital d'imatges | ca |
dc.subject.classification | Visió per ordinador | ca |
dc.subject.classification | Aliments | ca |
dc.subject.other | Neural networks (Computer science) | en |
dc.subject.other | Machine learning | en |
dc.subject.other | Computer software | en |
dc.subject.other | Digital image processing | en |
dc.subject.other | Computer vision | en |
dc.subject.other | Bachelor's theses | en |
dc.subject.other | Food | en |
dc.title | A multimodal deep learning approach for food tray recognition | ca |
dc.type | info:eu-repo/semantics/bachelorThesis | ca |
dc.rights.accessRights | info:eu-repo/semantics/openAccess | ca |
Appears in Collections: | Programari - Treballs de l'alumnat Treballs Finals de Grau (TFG) - Matemàtiques Treballs Finals de Grau (TFG) - Enginyeria Informàtica |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
codi.zip | Codi font | 3.91 MB | zip | View/Open |
173728.pdf | Memòria | 7.84 MB | Adobe PDF | View/Open |
This item is licensed under a Creative Commons License