Avui, dilluns 8 de juny, el Dipòsit Digital no estarà operatiu de 15:00 a 17:00 h per tasques de manteniment. Disculpeu les molèsties.
Hoy, lunes 8 de junio, el Dipòsit Digital no estará operativo de 15:00 a 17:00 h debido a tareas de mantenimiento. Disculpen las molestias.
Today, Monday, Jun 8th, the Digital Repository will be unavailable due to a system update.

Document type

Bachelor thesis

Publication date

Publication license

memòria: cc-nc-nd (c) Alejandro Alfonso Hernández, 2020
Please use this identifier to cite or link to this item: https://hdl.handle.net/2445/167099

Audio-visual deep learning regression of apparent personality

Journal Title

Journal ISSN

Volume Title

Related resource

Abstract

[en] Personality perception is based on the relationship of the human being with the individuals of his surroundings. This kind of perception allows to obtain conclusions based on the analysis and interpretation of the observable, mainly face expressions, tone of voice and other nonverbal signals, allowing the construction of an apparent personality (or first impression) of people. Apparent personality (or first impressions) are subjective, and subjectivity is an inherent property of perception based exclusively on the point of view of each individual. In this project, we approximate such subjectivity using a multi-modal deep neural network with audiovisual signals as input and a late fusion strategy of handcrafted features, achieving accurate results. The aim of this work is to perform an analysis of the influence of automatic prediction for apparent personality (based on the Big-Five model), of the following characteristics: raw audio, visual information (sequence of face images) and high-level features, including Ekman's universal basic emotions, gender and age. To this end, we have defined different modalities, performing combinations of them and determining how much they contribute to the regression of apparent personality traits. The most remarkable results obtained through the experiments performed are as follows: in all modalities, females have a higher average accuracy than men, except in the modality with only audio; for happy emotion, the best accuracy score is found in the Conscientiousness trait; Extraversion and Conscientiousness traits get the highest accuracy scores in almost all emotions; visual information is the one that most positively influences the results; the combination of high-level features chosen slightly improves the accuracy performance for predictions.

Description

Treballs Finals de Grau d'Enginyeria Informàtica, Facultat de Matemàtiques, Universitat de Barcelona, Any: 2020, Director: Sergio Escalera Guerrero, Cristina Palmero Cantariño i Julio Jacques Junior

Citation

Citation

ALFONSO HERNÁNDEZ, Alejandro. Audio-visual deep learning regression of apparent personality. [consulted: 8 of June of 2026]. Available at: https://hdl.handle.net/2445/167099

Export metadata

JSON - METS

Share record