Algorithmic Ventriloquism: The Contested State of Voice in AI Speech Generators

9Citations
Citations of this article
32Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

This article explores the vocal human–machine relations embedded in text-to-speech (TTS) generators. Retracing the human sources behind the synthetic speech and tracking the remediation of the voice by the machine-learning algorithm, it argues that artificial intelligence (AI) speaking agents such as Siri and Alexa, as well as other TTS acts such as TikTok’s, are performing algorithmic ventriloquism. Speaking mechanically with the voices of professional voiceover artists, AI speech technologies algorithmically manipulate these voices, thus generating personas that hold an interconnected chain of tensions between the embodied and the virtual, the particular and the general, the human and the non-human, as well as between speech and writing. Algorithmic ventriloquism serves as an analytical framework to tie the techno-vocalic operation of the TTS system with its cultural, economic, philosophical, and sociolinguistic predicaments. The last section discusses the implications of algorithmic ventriloquism beyond the realm of the voice.

Cite

CITATION STYLE

APA

Ramati, I. (2024). Algorithmic Ventriloquism: The Contested State of Voice in AI Speech Generators. Social Media and Society, 10(1). https://doi.org/10.1177/20563051231224401

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free