Analysis and assessment of controllability of an expressive deep learning-based tts system

Noé Tits; Kevin El Haddad; Thierry Dutoit

Journal ArticleOPEN ACCESS

Analysis and assessment of controllability of an expressive deep learning-based tts system

Informatics (2021) 8(4)

DOI: 10.3390/informatics8040084

3Citations

16Readers

Abstract

In this paper, we study the controllability of an Expressive TTS system trained on a dataset for a continuous control. The dataset is the Blizzard 2013 dataset based on audiobooks read by a female speaker containing a great variability in styles and expressiveness. Controllability is evaluated with both an objective and a subjective experiment. The objective assessment is based on a measure of correlation between acoustic features and the dimensions of the latent space representing expressiveness. The subjective assessment is based on a perceptual experiment in which users are shown an interface for Controllable Expressive TTS and asked to retrieve a synthetic utterance whose expressiveness subjectively corresponds to that a reference utterance.

Author supplied keywords

Cite

CITATION STYLE

APA

Tits, N., El Haddad, K., & Dutoit, T. (2021). Analysis and assessment of controllability of an expressive deep learning-based tts system. Informatics, 8(4). https://doi.org/10.3390/informatics8040084

Analysis and assessment of controllability of an expressive deep learning-based tts system

Abstract

Author supplied keywords

Cite

Register to see more suggestions