Please use this identifier to cite or link to this item: http://hdl.handle.net/2445/171321
Title: DISCOver: DIStributional approach based on syntactic dependencies for discovering COnstructions
Author: Martí Antonin, M. Antònia
Taulé Delor, Mariona
Kovatchev, Venelin
Salamó Llorente, Maria
Keywords: Gramàtica cognitiva
Models lingüístics
Semàntica
Cognitive grammar
Linguistic models
Semantics
Issue Date: 4-Jan-2019
Publisher: De Gruyter Mouton
Abstract: One of the goals in Cognitive Linguistics is the automatic identification and analysis of constructions, since they are fundamental linguistic units for understanding language. This article presents DISCOver, an unsupervised methodology for the automatic discovery of lexico-syntactic patterns that can be considered as candidates for constructions. This methodology follows a distributional semantic approach. Concretely, it is based on our proposed pattern-construction hypothesis: those contexts that are relevant to the definition of a cluster of semantically related words tend to be (part of) lexico-syntactic constructions. Our proposal uses Distributional Semantic Models for modelling the context taking into account syntactic dependencies. After a clustering process, we linked all those clusters with strong relationships and we use them as a source of information for deriving lexico-syntactic patterns, obtaining a total number of 220,732 candidates from a 100 million token corpus of Spanish. We evaluated the patterns obtained intrinsically, applying statistical association measures and they were also evaluated qualitatively by experts. Our results were superior to the baseline in both quality and quantity in all cases. While our experiments have been carried out using a Spanish corpus, this methodology is language independent and only requires a large corpus annotated with the parts of speech and dependencies to be applied.
Note: Reproducció del document publicat a: https://doi.org/10.1515/cllt-2018-0028
It is part of: Corpus Linguistics and Linguistic Theory, 2019
URI: http://hdl.handle.net/2445/171321
Related resource: https://doi.org/10.1515/cllt-2018-0028
ISSN: 1613-7027
Appears in Collections:Articles publicats en revistes (Filologia Catalana i Lingüística General)

Files in This Item:
File Description SizeFormat 
683887.pdf555.64 kBAdobe PDFView/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.