Comparative analysis of open source large language models

dc.contributor.advisorOrtiz Martínez, Daniel
dc.contributor.advisorArpírez Vega, Julio César
dc.contributor.authorFayos i Pérez, Victor
dc.date.accessioned2024-09-16T06:53:43Z
dc.date.available2024-09-16T06:53:43Z
dc.date.issued2024-06-30
dc.descriptionTreballs finals del Màster de Fonaments de Ciència de Dades, Facultat de matemàtiques, Universitat de Barcelona. Curs: 2023-2024. Tutor: Daniel Ortiz Martínezca
dc.description.abstract[en] This study investigates the potential of using smaller, locally hosted language models (LLMs) to perform specific tasks traditionally handled by large LLMs, such as OpenAI’s Chat-GPT 3.5. With the growing integration of LLMs in corporate environments, concerns over costs, data privacy, and security have become prominent. By focusing on question answering and text summarization tasks, we compare the performance of several smaller models, including Flan T5 XXL, Phi 3 Mini, and Yi 1.5, against Chat-GPT 3.5. As the two experiments show, one on question answering and the second one on text summarization, this tasks can be done by the tested models at the same level than the state of the art Chat-GPT 3.5. Concluding that depending the use intended for the LLM one of the different models could best fit as the variety in the response structure and verbosity highly depends on the model selected.ca
dc.format.extent35 p.
dc.format.mimetypeapplication/pdf
dc.identifier.urihttps://hdl.handle.net/2445/215162
dc.language.isoengca
dc.rightscc-by-nc-nd (c) Victor Fayos i Pérez, 2024
dc.rightscodi: GPL (c) Victor Fayos i Pérez, 2024
dc.rights.accessRightsinfo:eu-repo/semantics/openAccessca
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/3.0/es/*
dc.rights.urihttp://www.gnu.org/licenses/gpl-3.0.ca.html*
dc.sourceMàster Oficial - Fonaments de la Ciència de Dades
dc.subject.classificationTractament del llenguatge natural (Informàtica)
dc.subject.classificationSistemes informàtics interactius
dc.subject.classificationBots (Programes d'ordinador)
dc.subject.classificationTreballs de fi de màster
dc.subject.otherNatural language processing (Computer science)
dc.subject.otherInteractive computer systems
dc.subject.otherInternet bots (Computer software)
dc.subject.otherMaster's thesis
dc.titleComparative analysis of open source large language modelsca
dc.typeinfo:eu-repo/semantics/masterThesisca

Fitxers

Paquet original

Mostrant 1 - 2 de 2
Carregant...
Miniatura
Nom:
tfm_fayos_victor.pdf
Mida:
5.58 MB
Format:
Adobe Portable Document Format
Descripció:
Memòria
Carregant...
Miniatura
Nom:
codi_font.zip
Mida:
16.79 MB
Format:
ZIP file
Descripció:
Codi font