Please use this identifier to cite or link to this item:
http://hdl.handle.net/2445/130028
Title: | Discurse Marker Characterisation Via Clustering: Extrapolation from Supervised to Unsupervised Corpora |
Author: | Alonso, Laura Castellón Masalles, Irene Padró, Lluís Gibert, Karina |
Keywords: | Tractament del llenguatge natural (Informàtica) Marcadors del discurs Natural language processing (Computer science) Discourse markers |
Issue Date: | 2002 |
Publisher: | Sociedad Española para el Procesamiento del Lenguaje Natural (SEPLN) |
Abstract: | In this paper we will show how clustering techniques provide empirical evidence for a characterisation of Discourse Markers (DMs) that helps in overcoming the lack of consensus and reduces the cost of building NLP resources based on DMs. By comparison of classifications from hand-tagged and unsupervised corpora we are capable of grounding a notion of DM prototypicality, from which reliable classifications can be obtained from fully unsupervised corpora. |
Note: | Reproducció del document publicat a: http://journal.sepln.org/sepln/ojs/ojs/index.php/pln/article/view/3257 |
It is part of: | Procesamiento del lenguaje natural , 2002, num. 29, p. 223-230 |
URI: | http://hdl.handle.net/2445/130028 |
ISSN: | 1135-5948 |
Appears in Collections: | Articles publicats en revistes (Filologia Catalana i Lingüística General) |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
514597.pdf | 1.08 MB | Adobe PDF | View/Open |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.