Comparative Study of Clustering Techniques for Hypnogram Analysis and User-Level Insights

dc.contributor.advisorSeguí Mesquida, Santi
dc.contributor.advisorBrull Martínez, María
dc.contributor.authorCasas Herce, Carmen
dc.date.accessioned2026-03-30T14:50:48Z
dc.date.available2026-03-30T14:50:48Z
dc.date.issued2026-01-16
dc.descriptionTreballs finals del Màster de Fonaments de Ciència de Dades, Facultat de matemàtiques, Universitat de Barcelona. Any: 2026. Tutor: Santi Seguí Mesquida i María Brull Martínez
dc.description.abstractThis thesis aims to develop an unsupervised clustering framework to identify patterns in sleep data recorded by wearable devices. The work compares different algorithms, focusing on distance metrics and feature representations tailored to categorical time series. Firstly, it presents a comparative review of the literature on sleep pattern clustering from polysomnography and wearable data. It summarizes common approaches, feature engineering and validation strategies, and analyses how these methods influence the choices made in this work. Secondly, six clustering algorithms are applied to the sleep data and evaluated using standard clustering scores and the evolution of inertia as the number of clusters increases, in order to assess both stability and interpretability. In particular, the k-modes baseline produces clusters that fail to capture clear differences in sleep patterns, while agglomerative clustering with Hamming distance applied to a distance matrix generates very distinctive but unbalanced groups. To obtain more stable and interpretable groups, K-means clustering is explored using both Dynamic Time Warping (although the algorithm is not designed for categorical data) on the full sequences and a compact feature-based representation including sleep efficiency, REM and deep sleep percentages, and the number of awakenings lasting longer than 5 minutes. Finally, feature-envelope approaches that summarize the temporal evolution of these features across the night are implemented, obtaining a higher quality clustering results and a better characterization of sleep patterns. The conclusions focus primarily on lower values of k, where clustering metrics indicate better performance, suggesting that the underlying structure of the data is more continuous than discrete.
dc.format.extent26 p.
dc.format.mimetypeapplication/pdf
dc.identifier.urihttps://hdl.handle.net/2445/228608
dc.language.isoeng
dc.rightsmemòria: cc-by-nc-nd (c) Carmen Casas Herce, 2026
dc.rightscodi: GPL (c) Carmen Casas Herce, 2026
dc.rights.accessRightsinfo:eu-repo/semantics/openAccess
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/3.0/es/
dc.rights.urihttp://www.gnu.org/licenses/gpl-3.0.ca.html
dc.sourceMàster Oficial - Fonaments de la Ciència de Dades
dc.subject.classificationTrastorns del son
dc.subject.classificationAlgorismes computacionals
dc.subject.classificationAnàlisi de conglomerats
dc.subject.classificationCarmen Casas Herce
dc.subject.classificationTreballs de fi de màster
dc.subject.otherSleep disorders
dc.subject.otherComputer algorithms
dc.subject.otherCluster analysis
dc.subject.otherMaster's thesis
dc.titleComparative Study of Clustering Techniques for Hypnogram Analysis and User-Level Insights
dc.typeinfo:eu-repo/semantics/masterThesis

Fitxers

Paquet original

Mostrant 1 - 2 de 2
Carregant...
Miniatura
Nom:
TFM_Casas_Herce_Carmen.pdf
Mida:
3.73 MB
Format:
Adobe Portable Document Format
Carregant...
Miniatura
Nom:
Code.zip
Mida:
6.98 MB
Format:
ZIP file