NeuroClean: multipurpose neural data preprocessing pipeline

Hernández Alonso, Manuel Andrés

Please use this identifier to cite or link to this item: https://hdl.handle.net/2445/200773

Title:	NeuroClean: multipurpose neural data preprocessing pipeline
Author:	Hernández Alonso, Manuel Andrés
Director/Tutor:	Cos Aguilera, Ignasi DePass , Michael
Keywords:	Electroencefalografia Processament de dades Programari Treballs de fi de grau Aprenentatge automàtic Electroencephalography Data processing Computer software Machine learning Bachelor's theses
Issue Date:	12-Jun-2023
Abstract:	[en] Electroencephalography (EEG) and Local field potentials (LFP) are two commonly used measures of electrical activity in the brain. These signals are used extensively in both industry and research and have many real world applications. Before any analyses can be performed on EEG/LFP, however, the data must first be cleaned. The main objective of this project was to create an unsupervised, multipurpose EEG/LFP preprocessing pipeline. Its unsupervised nature would, consequently, help alleviate problems involving reproducibility and biases that arise from human intervention. Moreover, manual signal cleaning is time and labor intensive. The adoption of an automated workflow would, therefore, save researchers valuable time and resources. A secondary goal was to allow the pipeline to be fit to several use cases, thus standardizing the cleaning methods used in neuroscience. We designed an automated EEG/LFP preprocessing pipeline, NeuroClean, which consists of five steps: bandpass filtering, line noise filtering, bad channel rejection, and independent component analysis with automatic component rejection based on a clustering algorithm. Machine learning classifiers were used to ensure task-relevant signals were preserved after each step of the cleaning process. We used an LFP dataset recorded from a cynomolgus macaque to validate the pipeline. Data was recorded while the monkey performed a reach-to-grasp task, and three sections of the movement were used for classification. NeuroClean appeared to remove several common types of artifacts from the signal. Moreover, it yielded over 97% accuracy (whereas chance-level is 33.3%) in an optimized Multinomial Logistic Regression model after cleaning the data, compared to the raw data which performed at 74% accuracy. The results show that NeuroClean is a promising pipeline and workflow that may be explored in the future.
Note:	Treballs Finals de Grau d'Enginyeria Informàtica, Facultat de Matemàtiques, Universitat de Barcelona, Any: 2023, Director: Ignasi Cos Aguilera i Michael DePass
URI:	https://hdl.handle.net/2445/200773
Appears in Collections:	Programari - Treballs de l'alumnat Treballs Finals de Grau (TFG) - Enginyeria Informàtica

Files in This Item:

File	Description	Size	Format
tfg_hernandez_alonso_manuel_andres.pdf	Memòria	4.52 MB	Adobe PDF	View/Open
codi.zip	Codi font	20.49 kB	zip	View/Open

Show full item record

This item is licensed under a Creative Commons License