Vitrià i Marca, JordiLambrou, Theodoros2025-09-192025-09-192025-06-30https://hdl.handle.net/2445/223273Treballs finals del Màster de Fonaments de Ciència de Dades, Facultat de matemàtiques, Universitat de Barcelona. Any: 2025. Tutor: Jordi Vitrià i MarcaAccurately forecasting traffic incident severity is crucial for urban mobility planning and real-time traffic management. This thesis explores a hybrid approach to classifying traffic severity levels using statistical and machine learning techniques. The dataset includes road segment-level hourly traffic observations in London, enriched with engineered features such as recent severity history, weather conditions, and baseline severity probabilities. We evaluate a range of models, from simple baselines to advanced classifiers, with a focus on Random Forest and XGBoost. After extensive experimentation, a tuned Random Forest model using balanced subsampling and moderate tree depth outperformed all other approaches in terms of macro-averaged F1-score and minority class recall. Detailed evaluation through time-based cross-validation, SHAP analysis, and visual diagnostics demonstrates the robustness of this model and highlights key predictive factors. The findings suggest that combining short-term temporal features with baseline statistical probabilities significantly improves performance, particularly for under-represented severity classes. The report also discusses limitations related to data coverage, class imbalance, and the potential of incorporating external signals such as incidents or public transport disruptions in future work. The corresponding python notebooks, scripts and data for this thesis are located in this GitHub repository: https://github.com/theol-10/datascience-thesis/.33 p.application/pdfengcc-by-nc-nd (c) Theodoros Lambrou, 2025cc-by-nc-nd (c) Theodoros Lambrou, 2025http://creativecommons.org/licenses/by-nc-nd/3.0/es/http://www.gnu.org/licenses/gpl-3.0.ca.htmlCirculació urbanaAprenentatge automàticProbabilitatsTreballs de fi de màsterSistemes classificadors (Intel·ligència artificial)Urban trafficMachine learningProbabilitiesMaster's thesisLearning classifier systemsForecasting Urban Traffic Patterns in London Using Hybrid AI Techniquesinfo:eu-repo/semantics/bachelorThesisinfo:eu-repo/semantics/openAccess