Carregant...
Tipus de document
ArticleVersió
Versió acceptadaData de publicació
Tots els drets reservats
Si us plau utilitzeu sempre aquest identificador per citar o enllaçar aquest document: https://hdl.handle.net/2445/182546
Corpora compilation for prosody-informed speech processing
Títol de la revista
Director/Tutor
ISSN de la revista
Títol del volum
Recurs relacionat
Resum
Research on speech technologies necessitates spoken data, which is usually obtained through read recorded speech, and specifically adapted to the research needs. When the aim is to deal with the prosody involved in speech, the available data must reflect natural and conversational speech, which is usually costly and difficult to get. This paper presents a machine learning-oriented toolkit for collecting, handling, and visualization of speech data, using prosodic heuristic. We present two corpora resulting from these methodologies: PANTED corpus, containing 250 h of English speech from TED Talks, and Heroes corpus containing 8 h of parallel English and Spanish movie speech. We demonstrate their use in two deep learning-based applications: punctuation restoration and machine translation. The presented corpora are freely available to the research community.
Matèries (anglès)
Citació
Citació
ÖKTEM, Alp, FARRÚS, Mireia, BONAFONTE, Antonio. Corpora compilation for prosody-informed speech processing. _Language Resources And Evaluation_. 2021. Vol. 55, núm. 4, pàgs. 925-946. [consulta: 15 de gener de 2026]. ISSN: 1574-020X. [Disponible a: https://hdl.handle.net/2445/182546]