Cortés Martínez, JordiFernández Martínez, DanielChen, Xinnuo2026-02-042026-02-042025https://hdl.handle.net/2445/226616Treballs Finals de Grau en Estadística UB-UPC, Facultat d'Economia i Empresa (UB) i Facultat de Matemàtiques i Estadística (UPC), Curs: 2024-2025, Tutors: Jordi Cortés Martínez ; Daniel Fernández MartínezIn the context of the constant growth of artificial intelligence, the requirement for large volumes of data has become one of the main challenges. Using synthetic data is a viable alternative for addressing both the scarcity of real data and the need to protect information privacy. For synthetic data to be useful, it is essential to validate that the characteristics of the original data are preserved. This study analyses the reliability of the SPECKS metric for measuring similarity between real and synthetic data in cluster analysis. Several factors affecting the ability of algorithms to repli cate the structure of the original clusters are examined through simulations. The relationship between SPECKS and clustering metrics that allow the similarity of the clusters’ structure to be evaluated is also studied to determine whether SPECKS can be a good indicator of the quality of structural preservation in synthetic data clusters. The results suggest that SPECKS is insensitive to structural changes and is therefore not a suitable metric for evaluating structural quality in cluster analysis.51 p.application/pdfengcc-by-nc-nd (c) Chen, 2025http://creativecommons.org/licenses/by-nc-nd/4.0/Intel·ligència artificialMètodes de simulacióComputació distribuïdaTreballs de fi de grauArtificial intelligenceSimulation methodsComputational grids (Computer systems)Bachelor's thesesAssessment of the Resemblance Metrics for Synthetic data validationinfo:eu-repo/semantics/bachelorThesisinfo:eu-repo/semantics/openAccess