Document type

Article

Version

Published version

Publication date

Publication license

cc-by-sa (c) Rosado Rodrigo, Pilar et al., 2016
Please use this identifier to cite or link to this item: https://hdl.handle.net/2445/100830

Del píxel a las resonancias visuales: la imagen con voz propia

Journal Title

Director/Tutor

Journal ISSN

Volume Title

Related resource

Abstract

The objective of our research is to develop a series of computer vision programs to search for analogies in large datasets¿in this case, collections of images of abstract paintings¿ based solely on their visual content without textual annotation. We have programmed an algorithm based on a specific model of image description used in computer vision. This approach involves placing a regular grid over the image and selecting a pixel region around each node. Dense features computed over this regular grid with overlapping patches are used to represent the images. Analysing the distances between the whole set of image descriptors we are able to group them according to their similarity and each resulting group will determines what we call 'visual words'. This model is called Bag-of-Words representation Given the frequency with which each visual word occurs in each image, we apply the method pLSA (Probabilistic Latent Semantic Analysis), a statistical model that classifies fully automatically, without any textual annotation, images according to their formal patterns. In

Citation

Citation

ROSADO RODRIGO, Pilar, FIGUERAS FERRER, Eva and REVERTER COMES, Ferran. Del píxel a las resonancias visuales: la imagen con voz propia. AusArt. Journal for Research in Art. 2016. Vol. 4, num. 1, pags. 19-28. ISSN 2340-8510. [consulted: 10 of June of 2026]. Available at: https://hdl.handle.net/2445/100830

Export metadata

JSON - METS

Share record