Speaker diarization and speech recognition in the semi-automatization of audio description: An exploratory study on future possibilities?

  • Delgado H
  • Matamala A
  • Serrano J
N/ACitations
Citations of this article
12Readers
Mendeley users who have this article in their library.

Abstract

This article presents an overview of the technological components usedin the process of audio description, and suggests a new scenario inwhich speech recognition, machine translation, and text-to-speech, withthe corresponding human revision, could be used to increase audiodescription provision. The article focuses on a process in which bothspeaker diarization and speech recognition are used in order to obtain asemi-automatic transcription of the audio description track. Thetechnical process is presented and experimental results are summarized.

Cite

CITATION STYLE

APA

Delgado, H., Matamala, A., & Serrano, J. (2015). Speaker diarization and speech recognition in the semi-automatization of audio description: An exploratory study on future possibilities? Cadernos de Tradução, 35(2), 308. https://doi.org/10.5007/2175-7968.2015v35n2p308

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free