Please use this identifier to cite or link to this item:
Title: Training machine learning models with synthetic data improves the prediction of ventricular origin in outflow tract ventricular arrhythmias
Author: Doste, Rubén
Lozano, Miguel
Jiménez Pérez, Guillermo
Mont Girbau, Lluís
Berruezo Sánchez, Antonio
Penela, Diego
Cámara, Óscar
Sebastián, Rafael
Keywords: Aprenentatge automàtic
Machine learning
Issue Date: 12-Aug-2022
Publisher: Frontiers
Abstract: In order to determine the site of origin (SOO) in outflow tract ventricular arrhythmias (OTVAs) before an ablation procedure, several algorithms based on manual identification of electrocardiogram (ECG) features, have been developed. However, the reported accuracy decreases when tested with different datasets. Machine learning algorithms can automatize the process and improve generalization, but their performance is hampered by the lack of large enough OTVA databases. We propose the use of detailed electrophysiological simulations of OTVAs to train a machine learning classification model to predict the ventricular origin of the SOO of ectopic beats. We generated a synthetic database of 12-lead ECGs (2,496 signals) by running multiple simulations from the most typical OTVA SOO in 16 patient-specific geometries. Two types of input data were considered in the classification, raw and feature ECG signals. From the simulated raw 12-lead ECG, we analyzed the contribution of each lead in the predictions, keeping the best ones for the training process. For feature-based analysis, we used entropy-based methods to rank the obtained features. A cross-validation process was included to evaluate the machine learning model. Following, two clinical OTVA databases from different hospitals, including ECGs from 365 patients, were used as test-sets to assess the generalization of the proposed approach. The results show that V2 was the best lead for classification. Prediction of the SOO in OTVA, using both raw signals or features for classification, presented high accuracy values (>0.96). Generalization of the network trained on simulated data was good for both patient datasets (accuracy of 0.86 and 0.84, respectively) and presented better values than using exclusively real ECGs for classification (accuracy of 0.84 and 0.76 for each dataset). The use of simulated ECG data for training machine learning-based classification algorithms is critical to obtain good SOO predictions in OTVA compared to real data alone. The fast implementation and generalization of the proposed methodology may contribute towards its application to a clinical routine.Copyright © 2022 Doste, Lozano, Jimenez-Perez, Mont, Berruezo, Penela, Camara and Sebastian.
Note: Reproducció del document publicat a:
It is part of: Frontiers In Physiology, 2022, vol. 13
Related resource:
ISSN: 1664-042X
Appears in Collections:Articles publicats en revistes (IDIBAPS: Institut d'investigacions Biomèdiques August Pi i Sunyer)

This item is licensed under a Creative Commons License Creative Commons