Carregant...
Miniatura

Tipus de document

Treball de fi de màster

Data de publicació

Llicència de publicació

cc-by-nc-nd (c) Margarida Gonçalves, 2023
Si us plau utilitzeu sempre aquest identificador per citar o enllaçar aquest document: https://hdl.handle.net/2445/213366

Selection of predictors for peripheral arterial disease using tree-based algorithms

Títol de la revista

Director/Tutor

ISSN de la revista

Títol del volum

Recurs relacionat

Resum

The purpose of this thesis is to collaborate with clinicians in order to enhance knowledge of peripheral arterial disease (PAD) by leveraging machine learning techniques to select variables sharing the strongest association with PAD among a set of predictors from a recent cross-sectional medical study carried out in Barcelona (Gonçalves-Martins et al., 2021). We built several machine learning models using Random Forest, Gradient Boosting Tree, and Extreme Gradient Boost classifiers to retrieve risk factors, of which Random Forest was the most efficient. Risk factors were obtained using the Shapley Additive Explanations’ (SHAP) library. Results were compared with the known outcome of the logistic regression model used in Gonçalves-Martins et al., 2021. We were able to replicate the main results of this study, as well as to discover new nuances of the factors that play a role in the development of PAD. Consistently with the above-mentioned study, the smoking habit was found to be a strong predictor for PAD both in women and in men, whereas hypertension was found to be a strong predictor for PAD in women, whereas diabetes was found to be a strong predictor for PAD in men. Surprisingly, dyslipidemia appeared to be negatively correlated with PAD. Furthermore, cholesterol levels and blood pressure levels could be unreliable for an analysis of risk factors for PAD, due to the effect of medication. Among our findings, we discovered that REGICOR scores are most consistent when their continuous value is used, and that history of cardiovascular events is especially influential on PAD in men. In addition, abdominal perimeter proved to be more efficient in general, but especially for women, in the prediction of PAD and discernment of its risk factors for PAD than body mass index and obesity.

Descripció

Treballs finals del Màster de Fonaments de Ciència de Dades, Facultat de matemàtiques, Universitat de Barcelona. Curs: 2022-2023. Tutor: Carles Casacuberta

Citació

Citació

GONÇALVES, Margarida. Selection of predictors for peripheral arterial disease using tree-based algorithms. [consulta: 15 de febrer de 2026]. [Disponible a: https://hdl.handle.net/2445/213366]

Exportar metadades

JSON - METS

Compartir registre