Please use this identifier to cite or link to this item:
http://hdl.handle.net/2445/132906
Full metadata record
DC Field | Value | Language |
---|---|---|
dc.contributor.advisor | Bolaños Solà, Marc | - |
dc.contributor.author | Valdivia Arriaza, Marc | - |
dc.date.accessioned | 2019-05-09T08:22:29Z | - |
dc.date.available | 2019-05-09T08:22:29Z | - |
dc.date.issued | 2018-06-27 | - |
dc.identifier.uri | http://hdl.handle.net/2445/132906 | - |
dc.description | Treballs Finals de Grau d'Enginyeria Informàtica, Facultat de Matemàtiques, Universitat de Barcelona, Any: 2018, Director: Marc Bolaños Solà | ca |
dc.description.abstract | [en] Food has become a very important aspect of our social activities. Since social networks and websites like Yelp appeared, their users have started uploading photos of their meals to the Internet. This factor leads to the development of food analysis models and food recognition. We propose a model to recognize the meal appearing in a picture from a list of menu items (candidates dishes). Which could serve for the recognize the selected meal in a restaurant. The system presented in this thesis does not need to train a new model for every new restaurant in a real case scenario. It learns to identify the components of an image and the relationship that they have with the name of the meal. The system introduced in this work computes the similarity between an image and a text sequence, which represents the name of the dish. The pictures are encoded using a combination of Convolutional Neural Networks to reduce the input image. While, the text is converted to a single vector applying a Long Short Term Memory network. These two vectors are compared and optimized using a similarity function. The similarity-based output is then used as a ranking algorithm for finding the most probable item in a menu list. According to the Ranking Loss metric, the results obtained by the model improve the baseline by a 15%. | ca |
dc.format.extent | 65 p. | - |
dc.format.mimetype | application/pdf | - |
dc.language.iso | eng | ca |
dc.rights | memòria: cc-by-nc-nd (c) Marc Valdivia Arriaza, 2018 | - |
dc.rights | codi: GPL (c) Marc Valdivia Arriaza, 2018 | - |
dc.rights.uri | http://creativecommons.org/licenses/by-nc-nd/3.0/es/ | * |
dc.rights.uri | http://www.gnu.org/licenses/gpl-3.0.ca.html | * |
dc.source | Treballs Finals de Grau (TFG) - Enginyeria Informàtica | - |
dc.subject.classification | Processament digital d'imatges | ca |
dc.subject.classification | Visió per ordinador | ca |
dc.subject.classification | Programari | ca |
dc.subject.classification | Treballs de fi de grau | ca |
dc.subject.classification | Reconeixement de formes (Informàtica) | ca |
dc.subject.classification | Cartes (Restauració) | ca |
dc.subject.classification | Aprenentatge automàtic | ca |
dc.subject.other | Digital image processing | en |
dc.subject.other | Computer vision | en |
dc.subject.other | Computer software | en |
dc.subject.other | Pattern recognition systems | en |
dc.subject.other | Menus | en |
dc.subject.other | Bachelor's theses | en |
dc.subject.other | Machine learning | en |
dc.title | Where am I eating? Image-based food menu recognition | ca |
dc.type | info:eu-repo/semantics/bachelorThesis | ca |
dc.rights.accessRights | info:eu-repo/semantics/openAccess | ca |
Appears in Collections: | Treballs Finals de Grau (TFG) - Enginyeria Informàtica Programari - Treballs de l'alumnat |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
codi_font.zip | Codi font | 380.39 kB | zip | View/Open |
memoria.pdf | Memòria | 4.61 MB | Adobe PDF | View/Open |
This item is licensed under a Creative Commons License