Iarg-AnCora: Spanish corpus annotated with implicit arguments

dc.contributor.authorTaulé Delor, Mariona
dc.contributor.authorPeris Morant, Aina
dc.contributor.authorRodríguez Hontoria, Horacio
dc.date.accessioned2020-10-21T13:29:46Z
dc.date.available2020-10-21T13:29:46Z
dc.date.issued2016-09-01
dc.date.updated2020-10-21T13:29:47Z
dc.description.abstractThis article presents the Spanish Iarg-AnCora corpus (400 k-words, 13,883 sentences) annotated with the implicit arguments of deverbal nominalizations (18,397 occurrences). We describe the methodology used to create it, focusing on the annotation scheme and criteria adopted. The corpus was manually annotated and an interannotator agreement test was conducted (81 % observed agreement) in order to ensure the reliability of the final resource. The annotation of implicit arguments results in an important gain in argument and thematic role coverage (128 % on average). It is the first corpus annotated with implicit arguments for the Spanish language with a wide coverage that is freely available. This corpus can subsequently be used by machine learning-based semantic role labeling systems, and for the linguistic analysis of implicit arguments grounded on real data. Semantic analyzers are essential components of current language technology applications, which need to obtain a deeper understanding of the text in order to make inferences at the highest level to obtain qualitative improvements in the results.
dc.format.extent28 p.
dc.format.mimetypeapplication/pdf
dc.identifier.idgrec669157
dc.identifier.issn1574-020X
dc.identifier.urihttps://hdl.handle.net/2445/171322
dc.language.isoeng
dc.publisherSpringer Verlag
dc.relation.isformatofVersió postprint del document publicat a: https://doi.org/10.1007/s10579-015-9334-3
dc.relation.ispartofLanguage Resources And Evaluation, 2016, vol. 50, num. 3, p. 549-584
dc.relation.urihttps://doi.org/10.1007/s10579-015-9334-3
dc.rights(c) Springer Verlag, 2016
dc.rights.accessRightsinfo:eu-repo/semantics/openAccess
dc.sourceArticles publicats en revistes (Filologia Catalana i Lingüística General)
dc.subject.classificationCorpus (Lingüística)
dc.subject.classificationSemàntica
dc.subject.classificationCastellà (Llengua)
dc.subject.otherCorpora (Linguistics)
dc.subject.otherSemantics
dc.subject.otherSpanish language
dc.titleIarg-AnCora: Spanish corpus annotated with implicit arguments
dc.typeinfo:eu-repo/semantics/article
dc.typeinfo:eu-repo/semantics/acceptedVersion

Fitxers

Paquet original

Mostrant 1 - 1 de 1
Carregant...
Miniatura
Nom:
669157.pdf
Mida:
1.09 MB
Format:
Adobe Portable Document Format