Comparing distributional semantic models for identifying groups of semantically related words

dc.contributor.authorKovatchev, Venelin
dc.contributor.authorSalamó Llorente, Maria
dc.contributor.authorMartí Antonin, M. Antònia
dc.date.accessioned2019-02-22T15:15:48Z
dc.date.available2019-02-22T15:15:48Z
dc.date.issued2016-09-15
dc.date.updated2019-02-22T15:15:48Z
dc.description.abstractDistributional Semantic Models (DSM) are growing in popularity in Computational Linguistics. DSM use corpora of language use to automatically induce formal representations of word meaning. This article focuses on one of the applications of DSM: identifying groups of semantically related words. We compare two models for obtaining formal representations: a well known approach (CLUTO) and a more recently introduced one (Word2Vec). We compare the two models with respect to the PoS coherence and the semantic relatedness of the words within the obtained groups. We also proposed a way to improve the results obtained by Word2Vec through corpus preprocessing. The results show that: a) CLUTO outperformsWord2Vec in both criteria for corpora of medium size; b) The preprocessing largely improves the results for Word2Vec with respect to both criteria.
dc.format.extent8 p.
dc.format.mimetypeapplication/pdf
dc.identifier.idgrec666326
dc.identifier.issn1135-5948
dc.identifier.urihttps://hdl.handle.net/2445/128718
dc.language.isoeng
dc.publisherSociedad Española para el Procesamiento del Lenguaje Natural (SEPLN)
dc.relation.isformatofReproducció del document publicat a: http://journal.sepln.org/sepln/ojs/ojs/index.php/pln/article/view/5343
dc.relation.ispartofProcesamiento del lenguaje natural , 2016, num. 57, p. 109-116
dc.rights(c) Kovatchev, Venelin et al., 2016
dc.rights.accessRightsinfo:eu-repo/semantics/openAccess
dc.sourceArticles publicats en revistes (Filologia Catalana i Lingüística General)
dc.subject.classificationTractament del llenguatge natural (Informàtica)
dc.subject.classificationSemàntica
dc.subject.otherNatural language processing (Computer science)
dc.subject.otherSemantics
dc.titleComparing distributional semantic models for identifying groups of semantically related words
dc.typeinfo:eu-repo/semantics/article
dc.typeinfo:eu-repo/semantics/publishedVersion

Fitxers

Paquet original

Mostrant 1 - 1 de 1
Carregant...
Miniatura
Nom:
666326.pdf
Mida:
263.86 KB
Format:
Adobe Portable Document Format