Please use this identifier to cite or link to this item:
http://hdl.handle.net/2445/128718
Full metadata record
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Kovatchev, Venelin | - |
dc.contributor.author | Salamó Llorente, Maria | - |
dc.contributor.author | Martí Antonin, M. Antònia | - |
dc.date.accessioned | 2019-02-22T15:15:48Z | - |
dc.date.available | 2019-02-22T15:15:48Z | - |
dc.date.issued | 2016-09-15 | - |
dc.identifier.issn | 1135-5948 | - |
dc.identifier.uri | http://hdl.handle.net/2445/128718 | - |
dc.description.abstract | Distributional Semantic Models (DSM) are growing in popularity in Computational Linguistics. DSM use corpora of language use to automatically induce formal representations of word meaning. This article focuses on one of the applications of DSM: identifying groups of semantically related words. We compare two models for obtaining formal representations: a well known approach (CLUTO) and a more recently introduced one (Word2Vec). We compare the two models with respect to the PoS coherence and the semantic relatedness of the words within the obtained groups. We also proposed a way to improve the results obtained by Word2Vec through corpus preprocessing. The results show that: a) CLUTO outperformsWord2Vec in both criteria for corpora of medium size; b) The preprocessing largely improves the results for Word2Vec with respect to both criteria. | - |
dc.format.extent | 8 p. | - |
dc.format.mimetype | application/pdf | - |
dc.language.iso | eng | - |
dc.publisher | Sociedad Española para el Procesamiento del Lenguaje Natural (SEPLN) | - |
dc.relation.isformatof | Reproducció del document publicat a: http://journal.sepln.org/sepln/ojs/ojs/index.php/pln/article/view/5343 | - |
dc.relation.ispartof | Procesamiento del lenguaje natural , 2016, num. 57, p. 109-116 | - |
dc.rights | (c) Kovatchev, Venelin et al., 2016 | - |
dc.source | Articles publicats en revistes (Filologia Catalana i Lingüística General) | - |
dc.subject.classification | Tractament del llenguatge natural (Informàtica) | - |
dc.subject.classification | Semàntica | - |
dc.subject.other | Natural language processing (Computer science) | - |
dc.subject.other | Semantics | - |
dc.title | Comparing distributional semantic models for identifying groups of semantically related words | - |
dc.type | info:eu-repo/semantics/article | - |
dc.type | info:eu-repo/semantics/publishedVersion | - |
dc.identifier.idgrec | 666326 | - |
dc.date.updated | 2019-02-22T15:15:48Z | - |
dc.rights.accessRights | info:eu-repo/semantics/openAccess | - |
Appears in Collections: | Articles publicats en revistes (Filologia Catalana i Lingüística General) |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
666326.pdf | 263.86 kB | Adobe PDF | View/Open |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.