The Information Structure-prosody interface in text-to-speech technologies. An empirical perspective

dc.contributor.authorDomínguez, Mónica
dc.contributor.authorFarrús, Mireia
dc.contributor.authorWanner, Leo
dc.date.accessioned2022-02-11T14:35:31Z
dc.date.available2022-02-11T14:35:31Z
dc.date.issued2022
dc.date.updated2022-02-11T14:35:32Z
dc.description.abstractThe correspondence between the communicative intention of a speaker in terms of Information Structure and the way this speaker reflects communicative aspects by means of prosody have been a fruitful field of study in Linguistics. However, text-to-speech applications still lack the variability and richness found in human speech in terms of how humans display their communication skills. Some attempts were made in the past to model one aspect of Information Structure, namely thematicity for its application to intonation generation in text-to-speech technologies. Yet, these applications suffer from two limitations: (i) they draw upon a small number of made-up simple question-answer pairs rather than on real (spoken or written) corpus material; and (ii) they do not explore whether any other interpretation would better suit a wider range of textual genres beyond dialogs. In this paper, two different interpretations of thematicity in the field of speech technologies are examined: the state-of-art binary (and flat) theme-rheme, and the hierarchical thematicity defined by Igor Mel'čuk within the Meaning-Text Theory. The outcome of the experiments on a corpus of native speakers of US English suggests that the latter interpretation of thematicity has a versatile implementation potential for text-to-speech applications of the Information Structure-prosody interface.
dc.format.mimetypeapplication/pdf
dc.identifier.idgrec706552
dc.identifier.issn1613-7027
dc.identifier.urihttps://hdl.handle.net/2445/183091
dc.language.isoeng
dc.publisherDe Gruyter Mouton
dc.relation.isformatofReproducció del document publicat a: https://doi.org/10.1515/cllt-2020-0008
dc.relation.ispartofCorpus Linguistics and Linguistic Theory, 2022, vol. 18, num. 2, p. 419-445
dc.relation.projectIDinfo:eu-repo/grantAgreement/EC/H2020/870930/EU//WELCOME
dc.relation.projectIDinfo:eu-repo/grantAgreement/EC/H2020/645012/EU//KRISTINA
dc.relation.urihttps://doi.org/10.1515/cllt-2020-0008
dc.rights(c) Domínguez, Mónica et al., 2022
dc.rights.accessRightsinfo:eu-repo/semantics/openAccess
dc.sourceArticles publicats en revistes (Filologia Catalana i Lingüística General)
dc.subject.classificationLingüística computacional
dc.subject.classificationAnàlisi prosòdica (Lingüística)
dc.subject.classificationEntonació (Fonètica)
dc.subject.classificationTema i rema
dc.subject.classificationCorpus (Lingüística)
dc.subject.otherComputational linguistics
dc.subject.otherProsodic analysis (Linguistics)
dc.subject.otherIntonation (Phonetics)
dc.subject.otherTopic and comment
dc.subject.otherCorpora (Linguistics)
dc.titleThe Information Structure-prosody interface in text-to-speech technologies. An empirical perspective
dc.typeinfo:eu-repo/semantics/article
dc.typeinfo:eu-repo/semantics/publishedVersion

Fitxers

Paquet original

Mostrant 1 - 1 de 1
Carregant...
Miniatura
Nom:
706552.pdf
Mida:
814.96 KB
Format:
Adobe Portable Document Format