Análisis de la riqueza léxica en el contexto de la clasificación de atributos demográficos latentes

Please use this identifier to cite or link to this item: http://hdl.handle.net/2445/130340

Title:	Análisis de la riqueza léxica en el contexto de la clasificación de atributos demográficos latentes
Author:	Roberto Rodríguez, John Alexander Martí Antonin, M. Antònia Salamó Llorente, Maria
Keywords:	Tractament del llenguatge natural (Informàtica) Lexicologia Natural language processing (Computer science) Lexicology
Issue Date:	1-Jun-2012
Publisher:	Sociedad Española para el Procesamiento del Lenguaje Natural (SEPLN)
Abstract:	In this paper we analyse the utility of lexical richness measures for predicting latent user attributes from Spanish opinionated texts. Our aim is to know how useful could be lexical richness to predict user's gender, age and regional origin. To this end, we applied 32 lexical richness measures over 1911 previously labeled texts with demographic information. This approach has the advantage that it is domain-independent with modest computational cost.
Note:	Reproducció del document publicat a: http://journal.sepln.org/sepln/ojs/ojs/index.php/pln/article/view/4493
It is part of:	Procesamiento del lenguaje natural , 2012, num. 48, p. 97-104
URI:	http://hdl.handle.net/2445/130340
ISSN:	1135-5948
Appears in Collections:	Articles publicats en revistes (Matemàtiques i Informàtica) Articles publicats en revistes (Filologia Catalana i Lingüística General)

Files in This Item:

File	Description	Size	Format
611158.pdf		791.57 kB	Adobe PDF	View/Open