Please use this identifier to cite or link to this item: http://hdl.handle.net/2445/130028
Title: Discurse Marker Characterisation Via Clustering: Extrapolation from Supervised to Unsupervised Corpora
Author: Alonso, Laura
Castellón Masalles, Irene
Padró, Lluís
Gibert, Karina
Keywords: Tractament del llenguatge natural (Informàtica)
Marcadors del discurs
Natural language processing (Computer science)
Discourse markers
Issue Date: 2002
Publisher: Sociedad Española para el Procesamiento del Lenguaje Natural (SEPLN)
Abstract: In this paper we will show how clustering techniques provide empirical evidence for a characterisation of Discourse Markers (DMs) that helps in overcoming the lack of consensus and reduces the cost of building NLP resources based on DMs. By comparison of classifications from hand-tagged and unsupervised corpora we are capable of grounding a notion of DM prototypicality, from which reliable classifications can be obtained from fully unsupervised corpora.
Note: Reproducció del document publicat a: http://journal.sepln.org/sepln/ojs/ojs/index.php/pln/article/view/3257
It is part of: Procesamiento del lenguaje natural , 2002, num. 29, p. 223-230
URI: http://hdl.handle.net/2445/130028
ISSN: 1135-5948
Appears in Collections:Articles publicats en revistes (Filologia Catalana i Lingüística General)

Files in This Item:
File Description SizeFormat 
514597.pdf1.08 MBAdobe PDFView/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.