Please use this identifier to cite or link to this item:
http://hdl.handle.net/2445/200102
Full metadata record
DC Field | Value | Language |
---|---|---|
dc.contributor.advisor | Duran Frigola, Miquel | - |
dc.contributor.advisor | Vitrià i Marca, Jordi | - |
dc.contributor.author | Torre García, Marcos de la | - |
dc.date.accessioned | 2023-06-29T09:32:09Z | - |
dc.date.available | 2023-06-29T09:32:09Z | - |
dc.date.issued | 2023-01-15 | - |
dc.identifier.uri | http://hdl.handle.net/2445/200102 | - |
dc.description | Treballs finals del Màster de Fonaments de Ciència de Dades, Facultat de matemàtiques, Universitat de Barcelona. Curs: 2022-2023. Tutor: Miquel Duran Frigola i Jordi Vitrià i Marca | ca |
dc.description.abstract | [en] Predictive modelling of antimicrobial activity of molecules is a crucial step towards the discovery of anti-infective medicines. Unfortunately, there is a shortage of models covering endemic pathogens of the Global South, reflecting the existing bias in research towards diseases prevalent in wealthy countries. This project has developed a pipeline to systematically build drug discovery models, in particular antimicrobial activity prediction models for small molecule compounds. The data of assay results on a selected pathogen is extracted from a publicly available database: ChEMBL. This data is then cleaned and processed in order to build predictive models with various Automated Machine Learning (AutoML) techniques using the ZairaChem tool from the Ersilia Open Data Initiative. The pipeline has been applied on 6 pathogens of great relevance to global health known as ESKAPE, for which the data has been obtained and processed, and baseline models created. We have built the full set of final models for one of these pathogens, Staphylococcus aureus. The pipeline can be used on any other pathogen for which ChEMBL has sufficient data. This pipeline will be used to deploy models in the Ersilia Model Hub, a repository of pre-trained ML for drug discovery in global health. This will be an opportunity to compensate for the shortage of ML models adapted to the needs of the Global South. | ca |
dc.format.extent | 41 p. | - |
dc.format.mimetype | application/pdf | - |
dc.language.iso | eng | ca |
dc.rights | cc-by-nc-nd (c) Marcos de la Torre García, 2023 | - |
dc.rights | codi: GPL (c) Marcos de la Torre García, 2023 | - |
dc.rights.uri | http://www.gnu.org/licenses/gpl-3.0.ca.html | * |
dc.rights.uri | http://creativecommons.org/licenses/by-nc-nd/3.0/es/ | * |
dc.source | Màster Oficial - Fonaments de la Ciència de Dades | - |
dc.subject.classification | Teoria de la predicció | - |
dc.subject.classification | Disseny de medicaments | - |
dc.subject.classification | Bacteris patògens | - |
dc.subject.classification | Treballs de fi de màster | - |
dc.subject.classification | Aprenentatge automàtic | ca |
dc.subject.other | Prediction theory | - |
dc.subject.other | Drug design | - |
dc.subject.other | Pathogenic bacteria | - |
dc.subject.other | Master's theses | - |
dc.subject.other | Machine learning | en |
dc.title | Applying AutoML techniques in drug discovery: systematic modelling of antimicrobial drug activity on a wide spectrum of pathogens | ca |
dc.type | info:eu-repo/semantics/masterThesis | ca |
dc.rights.accessRights | info:eu-repo/semantics/openAccess | ca |
Appears in Collections: | Programari - Treballs de l'alumnat Màster Oficial - Fonaments de la Ciència de Dades |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
tfm_torre_garcia_marcos_de_la.pdf | Memòria | 3.56 MB | Adobe PDF | View/Open |
antimicrobial-ml-tasks-main.zip | Codi font | 15.44 MB | zip | View/Open |
This item is licensed under a Creative Commons License