Please use this identifier to cite or link to this item:
http://hdl.handle.net/2445/201714
Full metadata record
DC Field | Value | Language |
---|---|---|
dc.contributor.advisor | Radeva, Petia | - |
dc.contributor.advisor | Rodrigues Sepúlveda Marques, Ricardo Jorge | - |
dc.contributor.author | Puriy Puriy, Nazar | - |
dc.date.accessioned | 2023-09-06T09:33:35Z | - |
dc.date.available | 2023-09-06T09:33:35Z | - |
dc.date.issued | 2023-06-13 | - |
dc.identifier.uri | http://hdl.handle.net/2445/201714 | - |
dc.description | Treballs Finals de Grau d'Enginyeria Informàtica, Facultat de Matemàtiques, Universitat de Barcelona, Any: 2023, Director: Petia Radeva i Ricardo Jorge Rodrigues Sepúlveda Marques | ca |
dc.description.abstract | [en] The field of 3D generation, a rapidly emerging domain within generative AI, holds immense potential for various applications in fields such as architecture, product design, marketing, entertainment, and even in the novel realm of virtual reality. Enhancing 3D technologies bears significant utility in fostering society development and serves as a captivating and intellectually stimulating field of study, offering intriguing challenges and opportunities for innovative advancements. In this dissertation, we introduce DreamText, an innovative Image to 3D generative model that harnesses text descriptors as an intermediate step for 3D reconstruction. Our proposed method effectively learns to describe objects within images, capturing crucial object details while disregarding extraneous contextual information such as lighting, point of view, or specific arrangements. This learned information serves as the foundation for generating compelling and novel views of the object, subsequently facilitating the creation of a comprehensive and accurate 3D reconstruction. Remarkably, our approach achieves high quality results, surpassing current state-of-the-art methodologies like RealFusion (CVPR2023)[1] in several test cases. Concrete evidence of our results can be observed in the following link. Furthermore, we present FitFusion, which leverages the knowledge of a pretrained image generative model, Stable Diffusion, to train a Neural Radiance Field capable of generating 3D models when provided with image data during training. This concept stems from a comprehensive analysis and understanding of a previous model called Stable DreamFusion[2], combined with meticulous parameter tuning that culminates in improved outcomes. This project entails extensive mathematical and experimental analysis of cutting-edge models, encompassing a comprehensive understanding of their intricate details. | ca |
dc.format.extent | 103 p. | - |
dc.format.mimetype | application/pdf | - |
dc.language.iso | eng | ca |
dc.rights | memòria: cc-nc-nd (c) Nazar Puriy Puriy, 2023 | - |
dc.rights | codi: Apache (c) Nazar Puriy Puriy, 2023 | - |
dc.rights.uri | http://creativecommons.org/licenses/by-nc-nd/3.0/es/ | - |
dc.rights.uri | https://www.apache.org/licenses/LICENSE-2.0 | * |
dc.source | Treballs Finals de Grau (TFG) - Enginyeria Informàtica | - |
dc.subject.classification | Visualització tridimensional | ca |
dc.subject.classification | Intel·ligència artificial | ca |
dc.subject.classification | Programari | ca |
dc.subject.classification | Treballs de fi de grau | ca |
dc.subject.classification | Aprenentatge automàtic | ca |
dc.subject.other | Three-dimensional display systems | en |
dc.subject.other | Artificial intelligence | en |
dc.subject.other | Computer software | en |
dc.subject.other | Machine learning | en |
dc.subject.other | Bachelor's theses | en |
dc.title | DreamText: Harnessing text descriptions as an intermediate step for 3D reconstruction | ca |
dc.type | info:eu-repo/semantics/bachelorThesis | ca |
dc.rights.accessRights | info:eu-repo/semantics/openAccess | ca |
Appears in Collections: | Programari - Treballs de l'alumnat Treballs Finals de Grau (TFG) - Enginyeria Informàtica |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
tfg_puriy_puriy_nazar.pdf | Memòria | 81.41 MB | Adobe PDF | View/Open |
DreamText.zip | Codi font | 3.6 GB | zip | View/Open |
This item is licensed under a Creative Commons License