Abstract
This article presents an overview of the technological components usedin the process of audio description, and suggests a new scenario inwhich speech recognition, machine translation, and text-to-speech, withthe corresponding human revision, could be used to increase audiodescription provision. The article focuses on a process in which bothspeaker diarization and speech recognition are used in order to obtain asemi-automatic transcription of the audio description track. Thetechnical process is presented and experimental results are summarized.
Cite
CITATION STYLE
Delgado, H., Matamala, A., & Serrano, J. (2015). Speaker diarization and speech recognition in the semi-automatization of audio description: An exploratory study on future possibilities? Cadernos de Tradução, 35(2), 308. https://doi.org/10.5007/2175-7968.2015v35n2p308
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.