DNN-based speech synthesis: Importance of input features and training data

Alexandros Lazaridis; Blaise Potard; Philip N. Garner

Conference Proceedings

DNN-based speech synthesis: Importance of input features and training data

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2015) 9319 193-200

DOI: 10.1007/978-3-319-23132-7_24

7Citations

9Readers

Get full text

Abstract

Deep neural networks (DNNs) have been recently introduced in speech synthesis. In this paper, an investigation on the importance of input features and training data on speaker dependent (SD) DNN-based speech synthesis is presented. Various aspects of the training procedure of DNNs are investigated in this work. Additionally, several training sets of different size (i.e., 13.5, 3.6 and 1.5 h of speech) are evaluated.

Author supplied keywords

Cite

CITATION STYLE

APA

Lazaridis, A., Potard, B., & Garner, P. N. (2015). DNN-based speech synthesis: Importance of input features and training data. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 9319, pp. 193–200). Springer Verlag. https://doi.org/10.1007/978-3-319-23132-7_24

DNN-based speech synthesis: Importance of input features and training data

Abstract

Author supplied keywords

Cite

Register to see more suggestions