Corpora compilation for prosody-informed speech processing

dc.contributor.authorÖktem, Alp
dc.contributor.authorFarrús, Mireia
dc.contributor.authorBonafonte, Antonio
dc.date.accessioned2022-01-21T16:04:48Z
dc.date.available2022-01-21T16:04:48Z
dc.date.issued2021-12
dc.date.updated2022-01-21T16:04:48Z
dc.description.abstractResearch on speech technologies necessitates spoken data, which is usually obtained through read recorded speech, and specifically adapted to the research needs. When the aim is to deal with the prosody involved in speech, the available data must reflect natural and conversational speech, which is usually costly and difficult to get. This paper presents a machine learning-oriented toolkit for collecting, handling, and visualization of speech data, using prosodic heuristic. We present two corpora resulting from these methodologies: PANTED corpus, containing 250 h of English speech from TED Talks, and Heroes corpus containing 8 h of parallel English and Spanish movie speech. We demonstrate their use in two deep learning-based applications: punctuation restoration and machine translation. The presented corpora are freely available to the research community.
dc.format.extent22 p.
dc.format.mimetypeapplication/pdf
dc.identifier.idgrec713986
dc.identifier.issn1574-020X
dc.identifier.urihttps://hdl.handle.net/2445/182546
dc.language.isoeng
dc.publisherSpringer Verlag
dc.relation.isformatofVersió postprint del document publicat a: https://doi.org/10.1007/s10579-021-09556-2
dc.relation.ispartofLanguage Resources And Evaluation, 2021, vol. 55, num. 4, p. 925-946
dc.relation.urihttps://doi.org/10.1007/s10579-021-09556-2
dc.rights(c) Springer Verlag, 2021
dc.rights.accessRightsinfo:eu-repo/semantics/openAccess
dc.sourceArticles publicats en revistes (Filologia Catalana i Lingüística General)
dc.subject.classificationReconeixement automàtic de la parla
dc.subject.classificationTraducció automàtica
dc.subject.classificationPuntuació
dc.subject.classificationCorpus (Lingüística)
dc.subject.otherAutomatic speech recognition
dc.subject.otherMachine translating
dc.subject.otherPunctuation
dc.subject.otherCorpora (Linguistics)
dc.titleCorpora compilation for prosody-informed speech processing
dc.typeinfo:eu-repo/semantics/article
dc.typeinfo:eu-repo/semantics/acceptedVersion

Fitxers

Paquet original

Mostrant 1 - 1 de 1
Carregant...
Miniatura
Nom:
Öktem2021_Article_CorporaCompilationForProsody-i.pdf
Mida:
2.5 MB
Format:
Adobe Portable Document Format