Discurse Marker Characterisation Via Clustering: Extrapolation from Supervised to Unsupervised Corpora

dc.contributor.authorAlonso, Laura
dc.contributor.authorCastellón Masalles, Irene
dc.contributor.authorPadró, Lluís
dc.contributor.authorGibert, Karina
dc.date.accessioned2019-03-11T15:10:09Z
dc.date.available2019-03-11T15:10:09Z
dc.date.issued2002
dc.date.updated2019-03-11T15:10:10Z
dc.description.abstractIn this paper we will show how clustering techniques provide empirical evidence for a characterisation of Discourse Markers (DMs) that helps in overcoming the lack of consensus and reduces the cost of building NLP resources based on DMs. By comparison of classifications from hand-tagged and unsupervised corpora we are capable of grounding a notion of DM prototypicality, from which reliable classifications can be obtained from fully unsupervised corpora.
dc.format.extent8 p.
dc.format.mimetypeapplication/pdf
dc.identifier.idgrec514597
dc.identifier.issn1135-5948
dc.identifier.urihttps://hdl.handle.net/2445/130028
dc.language.isoeng
dc.publisherSociedad Española para el Procesamiento del Lenguaje Natural (SEPLN)
dc.relation.isformatofReproducció del document publicat a: http://journal.sepln.org/sepln/ojs/ojs/index.php/pln/article/view/3257
dc.relation.ispartofProcesamiento del lenguaje natural , 2002, num. 29, p. 223-230
dc.rights(c) Alonso, Laura et al., 2002
dc.rights.accessRightsinfo:eu-repo/semantics/openAccess
dc.sourceArticles publicats en revistes (Filologia Catalana i Lingüística General)
dc.subject.classificationTractament del llenguatge natural (Informàtica)
dc.subject.classificationMarcadors del discurs
dc.subject.otherNatural language processing (Computer science)
dc.subject.otherDiscourse markers
dc.titleDiscurse Marker Characterisation Via Clustering: Extrapolation from Supervised to Unsupervised Corpora
dc.typeinfo:eu-repo/semantics/article
dc.typeinfo:eu-repo/semantics/publishedVersion

Fitxers

Paquet original

Mostrant 1 - 1 de 1
Carregant...
Miniatura
Nom:
514597.pdf
Mida:
1.06 MB
Format:
Adobe Portable Document Format