Neural Network Strategies and Models for Voice Cloning in a Multi-speaker Mode: An Overview

0Citations
Citations of this article
2Readers
Mendeley users who have this article in their library.
Get full text

Abstract

The evolution of data science and the constant challenge of carrying out different processes using a few resources with simultaneous personalization has promoted interest in the development of voice cloning. Nowadays, different machine learning techniques are used, given their efficiency in generating relationships across multiple parameters. In this regard, we evaluated the best-performing models and the different process optimization strategies within this sector, where through neural network models separated modularly by their functionality, it is possible to generate independent processes taking into account the most significant number of linguistic factors in the generation of the voice, thus obtaining significant results of a clear improvement in the whole process of synthesizing the voice of a target speaker.

Cite

CITATION STYLE

APA

Fura-Mendoza, M., Moscol-Albañil, I., Rodriguez, C., Lezama, P., Rodriguez, D., & Pomachagua, Y. (2023). Neural Network Strategies and Models for Voice Cloning in a Multi-speaker Mode: An Overview. In Lecture Notes in Networks and Systems (Vol. 685 LNNS, pp. 229–237). Springer Science and Business Media Deutschland GmbH. https://doi.org/10.1007/978-981-99-1912-3_21

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free