Speaker diarization and speech recognition in the semi-automatization of audio description: An exploratory study on future possibilities?

Héctor Delgado; Anna Matamala; Javier Serrano

Journal ArticleOPEN ACCESS

Speaker diarization and speech recognition in the semi-automatization of audio description: An exploratory study on future possibilities?

Delgado H
Matamala A
Serrano J

Cadernos de Tradução (2015) 35(2) 308

DOI: 10.5007/2175-7968.2015v35n2p308

N/ACitations

12Readers

Abstract

This article presents an overview of the technological components usedin the process of audio description, and suggests a new scenario inwhich speech recognition, machine translation, and text-to-speech, withthe corresponding human revision, could be used to increase audiodescription provision. The article focuses on a process in which bothspeaker diarization and speech recognition are used in order to obtain asemi-automatic transcription of the audio description track. Thetechnical process is presented and experimental results are summarized.

Cite

CITATION STYLE

APA

Delgado, H., Matamala, A., & Serrano, J. (2015). Speaker diarization and speech recognition in the semi-automatization of audio description: An exploratory study on future possibilities? Cadernos de Tradução, 35(2), 308. https://doi.org/10.5007/2175-7968.2015v35n2p308

Speaker diarization and speech recognition in the semi-automatization of audio description: An exploratory study on future possibilities?

Abstract

Cite

Register to see more suggestions