Comparative analysis of open source large language models

Fayos i Pérez, Victor

Please use this identifier to cite or link to this item: https://hdl.handle.net/2445/215162

Full metadata record

DC Field	Value	Language
dc.contributor.advisor	Ortiz Martínez, Daniel	-
dc.contributor.advisor	Arpírez Vega, Julio César	-
dc.contributor.author	Fayos i Pérez, Victor	-
dc.date.accessioned	2024-09-16T06:53:43Z	-
dc.date.available	2024-09-16T06:53:43Z	-
dc.date.issued	2024-06-30	-
dc.identifier.uri	https://hdl.handle.net/2445/215162	-
dc.description	Treballs finals del Màster de Fonaments de Ciència de Dades, Facultat de matemàtiques, Universitat de Barcelona. Curs: 2023-2024. Tutor: Daniel Ortiz Martínez	ca
dc.description.abstract	[en] This study investigates the potential of using smaller, locally hosted language models (LLMs) to perform specific tasks traditionally handled by large LLMs, such as OpenAI’s Chat-GPT 3.5. With the growing integration of LLMs in corporate environments, concerns over costs, data privacy, and security have become prominent. By focusing on question answering and text summarization tasks, we compare the performance of several smaller models, including Flan T5 XXL, Phi 3 Mini, and Yi 1.5, against Chat-GPT 3.5. As the two experiments show, one on question answering and the second one on text summarization, this tasks can be done by the tested models at the same level than the state of the art Chat-GPT 3.5. Concluding that depending the use intended for the LLM one of the different models could best fit as the variety in the response structure and verbosity highly depends on the model selected.	ca
dc.format.extent	35 p.	-
dc.format.mimetype	application/pdf	-
dc.language.iso	eng	ca
dc.rights	cc-by-nc-nd (c) Victor Fayos i Pérez, 2024	-
dc.rights	codi: GPL (c) Victor Fayos i Pérez, 2024	-
dc.rights.uri	http://creativecommons.org/licenses/by-nc-nd/3.0/es/	*
dc.rights.uri	http://www.gnu.org/licenses/gpl-3.0.ca.html	*
dc.source	Màster Oficial - Fonaments de la Ciència de Dades	-
dc.subject.classification	Tractament del llenguatge natural (Informàtica)	-
dc.subject.classification	Sistemes informàtics interactius	-
dc.subject.classification	Bots (Programes d'ordinador)	-
dc.subject.classification	Treballs de fi de màster	-
dc.subject.other	Natural language processing (Computer science)	-
dc.subject.other	Interactive computer systems	-
dc.subject.other	Internet bots (Computer software)	-
dc.subject.other	Master's thesis	-
dc.title	Comparative analysis of open source large language models	ca
dc.type	info:eu-repo/semantics/masterThesis	ca
dc.rights.accessRights	info:eu-repo/semantics/openAccess	ca
Appears in Collections:	Màster Oficial - Fonaments de la Ciència de Dades Programari - Treballs de l'alumnat

Files in This Item:

File	Description	Size	Format
tfm_fayos_victor.pdf	Memòria	5.72 MB	Adobe PDF	View/Open
codi_font.zip	Codi font	17.19 MB	zip	View/Open

Show simple item record

This item is licensed under a Creative Commons License