Please use this identifier to cite or link to this item: http://hdl.handle.net/2445/182546
Title: Corpora compilation for prosody-informed speech processing
Author: Öktem, Alp
Farrús, Mireia
Bonafonte, Antonio
Keywords: Reconeixement automàtic de la parla
Traducció automàtica
Puntuació
Corpus (Lingüística)
Automatic speech recognition
Machine translating
Punctuation
Corpora (Linguistics)
Issue Date: Dec-2021
Publisher: Springer Verlag
Abstract: Research on speech technologies necessitates spoken data, which is usually obtained through read recorded speech, and specifically adapted to the research needs. When the aim is to deal with the prosody involved in speech, the available data must reflect natural and conversational speech, which is usually costly and difficult to get. This paper presents a machine learning-oriented toolkit for collecting, handling, and visualization of speech data, using prosodic heuristic. We present two corpora resulting from these methodologies: PANTED corpus, containing 250 h of English speech from TED Talks, and Heroes corpus containing 8 h of parallel English and Spanish movie speech. We demonstrate their use in two deep learning-based applications: punctuation restoration and machine translation. The presented corpora are freely available to the research community.
Note: Versió postprint del document publicat a: https://doi.org/10.1007/s10579-021-09556-2
It is part of: Language Resources And Evaluation, 2021, vol. 55, num. 4, p. 925-946
URI: http://hdl.handle.net/2445/182546
Related resource: https://doi.org/10.1007/s10579-021-09556-2
ISSN: 1574-020X
Appears in Collections:Articles publicats en revistes (Filologia Catalana i Lingüística General)

Files in This Item:
File Description SizeFormat 
Öktem2021_Article_CorporaCompilationForProsody-i.pdf2.56 MBAdobe PDFView/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.