Please use this identifier to cite or link to this item:
https://hdl.handle.net/2445/130028| Title: | Discurse Marker Characterisation Via Clustering: Extrapolation from Supervised to Unsupervised Corpora |
| Author: | Alonso, Laura Castellón Masalles, Irene Padró, Lluís Gibert, Karina |
| Keywords: | Tractament del llenguatge natural (Informàtica) Marcadors del discurs Natural language processing (Computer science) Discourse markers |
| Issue Date: | 2002 |
| Publisher: | Sociedad Española para el Procesamiento del Lenguaje Natural (SEPLN) |
| Abstract: | In this paper we will show how clustering techniques provide empirical evidence for a characterisation of Discourse Markers (DMs) that helps in overcoming the lack of consensus and reduces the cost of building NLP resources based on DMs. By comparison of classifications from hand-tagged and unsupervised corpora we are capable of grounding a notion of DM prototypicality, from which reliable classifications can be obtained from fully unsupervised corpora. |
| Note: | Reproducció del document publicat a: http://journal.sepln.org/sepln/ojs/ojs/index.php/pln/article/view/3257 |
| It is part of: | Procesamiento del lenguaje natural , 2002, num. 29, p. 223-230 |
| URI: | https://hdl.handle.net/2445/130028 |
| ISSN: | 1135-5948 |
| Appears in Collections: | Articles publicats en revistes (Filologia Catalana i Lingüística General) |
Files in This Item:
| File | Description | Size | Format | |
|---|---|---|---|---|
| 514597.pdf | 1.08 MB | Adobe PDF | View/Open |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.
