Machine learning methodologies versus cardiovascular risk scores, in predicting disease risk

dc.contributor.authorDimopoulos, Alexandros C.
dc.contributor.authorNikolaidou, Mara
dc.contributor.authorCaballero, Francisco Félix
dc.contributor.authorEngchuan, Worrawat
dc.contributor.authorSánchez Niubò, Albert
dc.contributor.authorArndt, Holger
dc.contributor.authorAyuso Mateos, José Luis
dc.contributor.authorHaro Abad, Josep Maria
dc.contributor.authorChatterji, Somnath
dc.contributor.authorGeorgousopoulou, Ekavi N.
dc.contributor.authorPitsavos, Christos
dc.contributor.authorPanagiotakos, Demosthenes B.
dc.date.accessioned2021-05-06T21:07:23Z
dc.date.available2021-05-06T21:07:23Z
dc.date.issued2018-12-29
dc.date.updated2021-05-06T21:07:24Z
dc.description.abstractBACKGROUND: The use of Cardiovascular Disease (CVD) risk estimation scores in primary prevention has long been established. However, their performance still remains a matter of concern. The aim of this study was to explore the potential of using ML methodologies on CVD prediction, especially compared to established risk tool, the HellenicSCORE. METHODS: Data from the ATTICA prospective study (n = 2020 adults), enrolled during 2001-02 and followed-up in 2011-12 were used. Three different machine-learning classifiers (k-NN, random forest, and decision tree) were trained and evaluated against 10-year CVD incidence, in comparison with the HellenicSCORE tool (a calibration of the ESC SCORE). Training datasets, consisting from 16 variables to only 5 variables, were chosen, with or without bootstrapping, in an attempt to achieve the best overall performance for the machine learning classifiers. RESULTS: Depending on the classifier and the training dataset the outcome varied in efficiency but was comparable between the two methodological approaches. In particular, the HellenicSCORE showed accuracy 85%, specificity 20%, sensitivity 97%, positive predictive value 87%, and negative predictive value 58%, whereas for the machine learning methodologies, accuracy ranged from 65 to 84%, specificity from 46 to 56%, sensitivity from 67 to 89%, positive predictive value from 89 to 91%, and negative predictive value from 24 to 45%; random forest gave the best results, while the k-NN gave the poorest results. CONCLUSIONS: The alternative approach of machine learning classification produced results comparable to that of risk prediction scores and, thus, it can be used as a method of CVD prediction, taking into consideration the advantages that machine learning methodologies may offer.
dc.format.extent11 p.
dc.format.mimetypeapplication/pdf
dc.identifier.idgrec686837
dc.identifier.issn1471-2288
dc.identifier.pmid30594138
dc.identifier.urihttps://hdl.handle.net/2445/177092
dc.language.isoeng
dc.publisherBioMed Central
dc.relation.isformatofReproducció del document publicat a: https://doi.org/10.1186/s12874-018-0644-1
dc.relation.ispartofBMC Medical Research Methodology, 2018, vol. 18, num. 1, p. 179
dc.relation.projectIDinfo:eu-repo/grantAgreement/EC/H2020/635316/EU//ATHLOS
dc.relation.urihttps://doi.org/10.1186/s12874-018-0644-1
dc.rightscc-by (c) Dimopoulos, Alexandros C. et al., 2018
dc.rights.accessRightsinfo:eu-repo/semantics/openAccess
dc.rights.urihttp://creativecommons.org/licenses/by/3.0/es
dc.sourceArticles publicats en revistes (Medicina)
dc.subject.classificationMalalties cardiovasculars
dc.subject.classificationAprenentatge automàtic
dc.subject.otherCardiovascular diseases
dc.subject.otherMachine learning
dc.titleMachine learning methodologies versus cardiovascular risk scores, in predicting disease risk
dc.typeinfo:eu-repo/semantics/article
dc.typeinfo:eu-repo/semantics/publishedVersion

Fitxers

Paquet original

Mostrant 1 - 1 de 1
Carregant...
Miniatura
Nom:
686837.pdf
Mida:
1011.83 KB
Format:
Adobe Portable Document Format