Neural Network Strategies and Models for Voice Cloning in a Multi-speaker Mode: An Overview

Marco Fura-Mendoza; Isabel Moscol-Albañil; Ciro Rodriguez; Pedro Lezama; Diego Rodriguez; Yuri Pomachagua

Conference Proceedings

Neural Network Strategies and Models for Voice Cloning in a Multi-speaker Mode: An Overview

Lecture Notes in Networks and Systems (2023) 685 LNNS 229-237

DOI: 10.1007/978-981-99-1912-3_21

0Citations

2Readers

Get full text

Abstract

The evolution of data science and the constant challenge of carrying out different processes using a few resources with simultaneous personalization has promoted interest in the development of voice cloning. Nowadays, different machine learning techniques are used, given their efficiency in generating relationships across multiple parameters. In this regard, we evaluated the best-performing models and the different process optimization strategies within this sector, where through neural network models separated modularly by their functionality, it is possible to generate independent processes taking into account the most significant number of linguistic factors in the generation of the voice, thus obtaining significant results of a clear improvement in the whole process of synthesizing the voice of a target speaker.

Author supplied keywords

Cite

CITATION STYLE

APA

Fura-Mendoza, M., Moscol-Albañil, I., Rodriguez, C., Lezama, P., Rodriguez, D., & Pomachagua, Y. (2023). Neural Network Strategies and Models for Voice Cloning in a Multi-speaker Mode: An Overview. In Lecture Notes in Networks and Systems (Vol. 685 LNNS, pp. 229–237). Springer Science and Business Media Deutschland GmbH. https://doi.org/10.1007/978-981-99-1912-3_21

Neural Network Strategies and Models for Voice Cloning in a Multi-speaker Mode: An Overview

Abstract

Author supplied keywords

Cite

Register to see more suggestions