DailyTalk: Spoken Dialogue Dataset for Conversational Text-to-Speech

15Citations
Citations of this article
24Readers
Mendeley users who have this article in their library.
Get full text

Abstract

The majority of current Text-to-Speech (TTS) datasets, which are collections of individual utterances, contain few conversational aspects. In this paper, we introduce DailyTalk, a high-quality conversational speech dataset designed for conversational TTS. We sampled, modified, and recorded 2,541 dialogues from the open-domain dialogue dataset DailyDialog inheriting its annotated attributes. On top of our dataset, we extend prior work as our baseline, where a non-autoregressive TTS is conditioned on historical information in a dialogue. From the baseline experiment with both general and our novel metrics, we show that DailyTalk can be used as a general TTS dataset, and more than that, our baseline can represent contextual information from DailyTalk. The DailyTalk dataset and baseline code are freely available for academic use with CC-BY-SA 4.0 license1

Cite

CITATION STYLE

APA

Lee, K., Park, K., & Kim, D. (2023). DailyTalk: Spoken Dialogue Dataset for Conversational Text-to-Speech. In ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings (Vol. 2023-June). Institute of Electrical and Electronics Engineers Inc. https://doi.org/10.1109/ICASSP49357.2023.10095751

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free